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1 Introduction 



Let P be a Markov kernel on a measurable space (X, Af). Let us introduce the weighted- 
supremum Banach space {By, \\ ■ \\v) composed of measurable functions / : X— )-C such that 

ll/l|y:=sup^<oo 

XGX V{X) 

where 1/ : X — 7'[1, +00). Let (Bq, \\ ■ ||o) be the usual Banach space composed of all the bounded 
measurable functions / : X— >-C equipped with the supremum norm ||/||o := sup^.gx 1/(2^)1- 

The first purpose of the paper is to study the quasi-compactness of P on By with a control 
of its essential spectral radius ress(P). Recall that ress(P) is the infimum bound of the positive 
real numbers rg for which the following property holds: the spectral values of P of modulus 
greater than tq are finitely many eigenvalues having a finite-dimensional characteristic space. 
P is said to be quasi-compact on By if fess is strictly less than the spectral radius of P (see 
Section 2 for details). The second purpose of the paper is to specify the link between quasi- 
compactness and the so-called F-geometric ergodicity [MT93], namely with the convergence 
of P" to vr in operator norm on By, where vr denotes the P-invariant probability measure. In 
this case, we are interested in finding upper bounds for the convergence rate py{P) defined 

by 

pyiP) := mi {pe (0,1), sup ||P"/-7r(/)||y = 0(/,")}. (1) 

ll/llv<l 

Finally the third purpose of the paper is to derive the ^-geometric ergodicity of P, with a 
control of py{P), from the strong ergodicity property with respect to some Lipschitz- weighted 
spaces. 

Note that this paper is not directly concerned with the essential spectral radius or the 
convergence rate of Markov chains either with respect to the Lebesgue space (vr) as studied 
for instance in [Wu04, Sec. 5] for general Markov kernels and in [AP07] for Hastings and 
Metropolis algorithms using operator methods (see [FHL12, Section 2] for an overview), or 
with respect to By with a bounded function V, that is for uniformly ergodic Markov chains 
as investigated for instance in [Wu04, Th. 3.10] and [Hen07, Cor. IV. 1]. Mention that the 
paper [Wu04] also deals with the essential spectral radius and the convergence rate of iterates 
of P acting on By. Actually Wu's article is the closest work to ours. Precise comparisons 
between our results and those of [Wu04] are included throughout the paper but the core is in 
Subsection 2.3. 

Let us give an account of the main results of the paper in regards to our objectives. Under 
irreducibility and aperiodicity assumptions, it is well-known that the ^-geometric ergodicity 
holds under the following drift condition: 

3^ G (0,1), 3M G (0,+oo), PV<gV + Mls, (D) 

where 5 C X satisfies the minorization condition 

Vx EX, G X, P{x, A) > ij{A) ls{x), (S) 
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for some positive measure v on (X,A') (see [MT93]). In Theorem 1 (Subsection 2.1), without 
assuming any irreducibihty or aperiodicity conditions, the quasi-compactness of P on By is 
proved under Conditions (D)-(S). This is an expected result, abeady obtained in [Hen06, 
Hen07]. We provide a simple and short proof of Theorem 1 which enables to well understand 
why the drift condition implies good spectral properties of P on By. Furthermore we obtain 
the following upper bound for rggsiP) which is more explicit than in [Hen06]: 

ressiP) < ^''^^^^^^ with T := max(0, M - iy{V)). (2) 

In Theorem 2 (Subsection 2.2), assuming that, for some £ > 1, is a compact operator 
from Bq to By , P is shown to be power-bounded and quasi-compact on By under the following 
weak drift condition 

3iVGN*, 3dG (0,+oo), 35g (0,1), P^V<6^V + dlx. (WD) 

Such a condition with = 1 has been introduced in [MT93, Lem. 15.2.8] as an alternative to 
the drift condition [MT93, (V4)] under suitable assumption on V. Under Condition (WD), 
let us define the real number 5y{P) G (0, 1) as the infimum of the real numbers 5 € [0, 1) such 
that we have (WD): 

6v{P) :=inf{5G [0,1) : 3iV G N*, 3d € (0,+oo), P^V < 6^ V + dlx} . (3) 
Then the upper bound obtained in Theorem 2 for r^ssiP) is more precise than (2), that is: 

ressiP) < 6y{P). 

The key idea to prove Theorem 2 is that Condition (WD) yields a Doeblin-Fortet inequality 
on the dual of By. This fact has been already used in [FHLll] to study regular perturbations 
of ^-geometrically ergodic Markov chains. Under assumptions based on sophisticated param- 
eters /3io(-) and /3r{-) for measure of non-compactness of P, Wu presented in [Wu04, Th. 3.11] 
a formula for ress{P) involving equivalent functions to V. The assumptions, the conclusion 
and the proof of Wu's result are different from ours, as explained in Subsection 2.3. The ques- 
tion to know if the equality ress(P) = Sy{P) holds true under the hypotheses of Theorem 2 
is open. However, by combining our Theorem 2 and Wu's result, we prove in Subsection 2.3 
that the answer to the previous question is positive in many situations. In particular we have 
ress{P) = ^viP) in all tlie examples of our paper. 

When the Markov kernel P has an invariant probability distribution, the connection be- 
tween the ^-geometric ergodicity and the quasi-compactness of P is recalled in Theorem 3 
(Subsection 2.4). Namely, P is ^-geometrically ergodic if and only if P is a power-bounded 
quasi-compact operator on By for which A = 1 is a simple eigenvalue and the unique eigen- 
value of modulus one. In this case, if V denotes the set of all the eigenvalues A of P such that 
ress{P) < 1^1 < 1) then the convergence rate py{P) is given by: 

py{P) = ressiP) if V = $ and py(P) = max{|A|, A G V} if V / 0. 

This result is valid for any quasi-compact operator, however we have not found such an explicit 
result in the literature on ^-geometric ergodicity. 
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Theorem 4 proved in Subsection 2.5 is of great interest to investigate the eigenvalues of 
moduhis one and the above set V in order to obtain the ^-geometric ergodicity of P and, 
more importantly, an upper bound for pv{P) from Theorem 3. Namely, under Condition 
(WD), for any A G C such that < |A| < 1 where 5 is given in (WD), and for any p G N*, 
we obtain with /3(A) := ln|A|/ln5: 

/ G i3y n Ker(P - A/)P 3cG(0,+oo), |/| < c (In y)^^^"^)/^ y/3(A)_ 

In particular, if A is an eigenvalue such that |A| = 1, then any associated eigen-function / is 
bounded on X. By contrast, if |A| is close to 5v{P)-, then |/| < cV^^-^'^ with /3(A) close to 1. 

In Section 3, applications of Theorems 2-4 to discrete Markov chains are presented. When 
X := N and lim„ ^(n) = +oo, any Markov kernel P is compact from Bq to Bv^ and The- 
orem 2-Theorem 3 are then specially relevant: if P satisfies Condition (WD), then P is 
power-bounded and quasi-compact on By] if in addition P is irreducible and aperiodic, then 
P is ^-geometrically ergodic. In Subsection 3.4, Property (4) is used to compute the conver- 
gence rate pv{P) for birth-and-death Markov chains. 

Section 4 is devoted to ^/-geometrical ergodicity of iterated function systems (IPS). The 
ideas developed in this section are based on Lipschitz contractive properties of P as in [Wu04, 
Sect. 7.2]. More precisely, in [Wu04, Sect. 7.2] the contractive properties are expressed in 
terms of Wassertein distance. Ours are expressed in terms of moment/contraction conditions, 
called {Co) (for some a G [l,-|-cx))), which are classical for IPS, see [Duf97, Ben98, DP99]. 
Under Conditions {Co) and our compactness assumption on P^ : Bq^Bv for some £ > 1, the 
same precise bounds on ress{P) and pv{P) as in [Wu04, Sect. 7.2] are obtained for IPSs in 
Corollary 6. The others statements of Section 4 show that, in certain cases. Condition (Ca) 
can be used directly to obtain further interesting rates of convergence of IPSs with explicit 
constants. 

To the best of our knowledge. Theorems 1, 2 and 4 of Section 2 are new. Moreover 
Theorem 2 and Wu's result [Wu04, Th. 3.11] are complementary since their combination 
provides the expected formula VessiP) = ^viP) under general assumptions. As in [Wu04], 
the bounds on pv{P) are derived from those on r^ssiP) by using Theorem 3, but here we 
take advantage of Theorem 4 to study the eigenvalues A of P such that 5v{P) < |A| < 1. 
This approach is original and often provides the exact value of pv{P)- Most of bounds on 
Pv{P) obtained in Section 3 are new. Of course this method can only be used for Markov 
kernels P such that P^ is compact from Bq to By for some I > 1. Classical instances of 
^/-geometrically ergodic Markov kernels concern the discrete state-space, the autoregressive 
models on X = R'^ with absolutely continuous noise with respect to the Lebesgue measure, 
and finally the MCMC algorithms. Our compactness assumption is fulfilled for the two first 
instances, see Section 3 and Subsection 4.4. Unfortunately it does not hold in general for 
non-discrete Markov kernels arising from Hastings and Metropolis algorithms. Concerning 
the last issue, we refer to the works [Bax05, Lun97, LT96, MT96, MT94, RT99] where the 
convergence rate pv{P) is investigated by probabilistic methods. The best rates are obtained 
in [LT96] under the stochastic monotonicity assumption for P which cannot be compared 
with our compactness assumption. 
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Throughout the paper, when the function V = Vc depends on some parameter c, we use 
the notation Be = By^ so that Be may stand for different sets from section to section. 

2 Quasi-compactness on By and F-geometric ergodicity 

Let (B, II • II) be a complex Banach space, and let L be a bounded linear operator on B with 
positive spectral radius r{L) := lim„ HL^H^/", where || • || also stands for the operator norm on 
B. For the sake of simplicity, we assume that r{L) := 1 (if not, replace L with r{L)^^L). The 
restriction of L to a L-invariant subspace H is denoted by L^j^, and / stands for the identity 
operator on B. 

The simplest definition of quasi-compactness is the following (compare the definition below 
with the reduction of matrices or compact operators). 

Definition 1 L is quasi- compact on B if there exist tq G (0, 1) and m G N*, Aj G C, pi G N* 
(i = 1, . . . ,m) such that: 

m 

B = e Ker(L - XJf^ H, (5a) 

i=l 

where the Aj 's are such that 

I Ail > ro and 1 < dimKer(L - XilY' < oo, (5b) 
and H is a closed L-invariant subspace such that 

sup ||L"/i|| = O(ro"). (5c) 

h£H, \\h\\<l 

Concerning the essential spectral radius of L, denoted by r^ssiL), here it is enough to have 
in mind that, if L is quasi-compact on B, then we have (see for instance [Hen93]) 

VessiL) = inf |ro G (0, 1) s.t. we have (5a) (5b) (5c)}. 

It is also well-known (e.g. see [Nev64, Kre85]) that r^ssiL) is defined by 

r,,,(L) :=lim(inf||L"-K||)" (6) 

where the infimum is taken over the ideal of compact operators K on B. Consequently L is 
quasi-compact if and only if there exist some G N* and some compact operator Kq on B 
such that riJJ^^ — Kq) < 1. Under the previous condition we have 

re..(L)<(r(L"«-ifo))'/"». (7) 

Indeed, for all /c > 1 we have ||(L"o - Ko)''\\'^^^'"'''^ = ||L"«'' - Kfcf/("ofc) ^ith some compact 
operator Kk on B. Then (6) gives: ress{L) < lim^ ||(L"o - Ko)''\\'^^^''°''^ = {r{L''° - 7^0))^/"°. 
Finally, for any £ > 1, since lim„(inf ||L" - K\\)'^/'^ = limfc(inf ||L^'^ - A"||)V{^fc), 

we obtain 

VessiL) = {ress{L')f". (8) 

Throughout the paper, we consider a function V : X— 7'[1, -|-oo) and a Markov kernel P on 
(X, Af) such that PV/V is bounded on X (i.e. ||Py||v/ < oo). So P continuously acts on By- 
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2.1 Quasi-compactness on By under the drift condition 



Theorem 1 Let us assume that the Conditions (D)-(S) in Introduction hold true. Then P 
is a power-bounded quasi-compact operator on By with 

s{P) < ^''/"^^^^^ with T := max(0, M - iy{V)). (9) 



The proof of Theorem 1 is based on the next lemma. 

Lemma 1 Let Q{x,dy) be a Markov kernel on (X, A:") having a continuous action on By 
(i.e. \\QV\\v < oo) such that Q = A -\- B for some nonnegative bounded linear operators A 
and B on By. Let r{B) denote the spectral radius of B which is assumed to be positive. 
Then, there exists a nontrivial nonnegative continuous linear form rj on By such that rjoB = 
r{B) r? and r?(Alx) = (1 - r(S))r/(lx). 

Proof. Since i? > and r := r{B) > 0, we know from [Sch71, App., Cor. 2. 6] that there 
exists a nontrivial nonnegative continuous linear form r] on By such that rj o B = rrj (see 
also Remark 2 and Appendix A). From Q = A -\- B, we have rj o Q = rj o A -\- r rj, thus 
r]{Qlx) = r/(lx) = r/(^lx) + rr/(lx). Hence r]{Alx) = (1 - r)r]{lx). □ 

Proof of Theorem 1. Condition (D) implies that PV < gV -\- M 1^. Iterating this inequality 
easily ensures that sup^ HP'^yH^/ < oo, that is P is power-bounded. Then, from Plx = Ix 
and Ix S By, we have r{P) = 1. Moreover, since < oo, we deduce from (S) that 

^{V) < oo. Thus we can define the following rank-one operator on By. Tf := I5. Let 
R := P -T. From T > and from (S), it follows that < R < P, so r{R) < 1. Let us set 
r := r{R). If r = 0, then P is quasi-compact with ress(P) = from (7). Now assume that 
r G (0, 1]. Then, from Lemma 1, there exists a nontrivial nonnegative continuous linear form 
r/ on By such that r] o R = r rj and r]{Tlx) = (1 — r)r/(lx), from which we deduce that 

(l-r)7?(lx) ^ jl-rUV) 

'^''^ = Kix) - Kix) ■ 

Next, we have RV = PV -TV = PV-v{V)ls < gV + Mls-iyiV)ls = gV+{M-iy{V)) I5. 
Hence, setting r := max(0, M — I'iV)) > 0, 

rr]{V) = r]{RV) < gr]{V) + r 77(15) < gr]{V) + ; ' . (10) 



Since 77 7^ 0, we have rjiy) > 0, and since g G (0, 1), we cannot have r = 1. Thus r S (0, 1), 
and P is quasi-compact from (7) with r^ssiP) ^ fessiR) = f- Then Inequality (9) is deduced 
from (10). □ 

Remark 1 // Conditions (D)-(S) are fulfilled for some iterate P^ in place of P (with pa- 
rameters gN < 1, Mj\f > and positive measure vn[-)), then the conclusions of Theorem 1 
hold true with (9) replaced by 

VessiP) = re.s(P^)^/^ < ( M^^^^ll^^) ^ y^hcrc Tj, := max(0, M^v - ^^(F)). 



7 



Remark 2 The proof of Lemma 1 is based on the following result [Schll, App., Cor. 2. 6]: 
if L is a positive operator on a Banach lattice B whose positive cone is normal and has 
interior points, then there exists a nontrivial nonnegative continuous linear form e' on B such 
that e' o L = r{L)e' . In fact By is the simplest (and generic) example of Banach lattices 
satisfying the last conditions, and we give in Appendix A a proof of the previous statement in 
this special case. Mention that this result also provides that the quasi- compactness of P on 
By is equivalent to the mean ergodicity with finite rank limit projection (see [Her08], see also 
[Lin75, Lin78]). 

2.2 Quasi-compactness on By under the weak drift condition (WD) 

Recall that {Bq, \\ ■ ||o) denotes the Banach space of all the bounded measurable functions 
/ : X— >-C, equipped with the supremum norm ||/||o := sup^.^^ and that Sy{P) is the 

infimum of the real numbers 6 G [0, 1) such that we have (WD) (see (3)). 

Theorem 2 // Condition (WD) holds true and if : Bo^By is compact for some i > 1, 
then P is a power-bounded quasi-compact operator on By, and we have 

TessiP) < 6yiP). 

Proof. Iterating (WD) shows that P is power-bounded on By (proceed as in the beginning 
of the proof of Theorem 1). Since 6y{P) = {5y{P'^))^/'^ and ress{P) = (re,^(P^))^/^ (see (8)), 
we only consider the case i := 1, that is P : Bq By is compact. 

Now let {By, II • \\y) (resp. {Bq, \\ • ||o)) denote the dual space of By (resp. of Bq). Note 
that we make a slight abuse of notation in writing again || • ||y and || • ||o for the dual norms. 
Let P* denote the adjoint operator of P on By. In fact, we prove that P* is a quasi-compact 
operator on By with ress{P*) ^ ^viP): so that P satisfies the same properties on By. Since 
P : Bo^By is assumed to be compact, then so is P* : By ^ Bq. Moreover P* satisfies a 
Doeblin-Fortet inequality from Lemma 2 below. Then we deduce from Lemma 2 and [Hen93] 
that P* is a quasi-compact operator on By, with r^ssiP*) ^ ^ for any 6 £ {6y{P), 1), so that 
TessiP*) < 5y{P). □ 

For the sake of simplicity we consider the same usual bracket notation (•, •) in both B'y x By 
and B'q X Bq. Recall that By,BQ are Banach lattices, so are B'y, B'q. For each g' £ By 
(resp. g' G Bq), one can define the modulus l^'l of g' in By (resp. in B'q), see [Sch71]. For 
the next arguments, it is enough to have in mind that g' and \g'\ have the same norm in B'y 
(resp. in B'q), more precisely: 

yg'€B'Q, ||5'||o = (|5'|,lx) and Vg' e B'y, H^'Hv^ = (l^'l, y). 

Lemma 2 Let 6 G {6y{P), 1). Then, there exist G N* and d G (0, -|-oo) such that for all 
f G B'y we have: ||P*^/'||v^ < 6^\\f'\\y + d||/'||o. 

Proof. Let /' G B'y and n > 1. Since P" is a nonnegative operator on By, so is its adjoint 
operator P*" on B'y, and we have for all / G By such that ||/||y < 1 (ie. |/| < V): 

\{{PTf'j)\ < {{PTinifi) < {{PTinv) = (i/'i,p"y). 
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By definition of 6v{P) and from 6 G (5y(P), 1), tliere exist N £ N* and d G (0, +00) such 
that P^V <5^V + dlx. Thus 

\\{P*ff'\\v := sup |((P*)V,/)i 

/eBv,||/||v<i 

< iinp^'v) 

< 5^ (i/'i, y) + d (i/'i, ix) = s^'Wf'Wv + d||/'||o. 

□ 



2.3 Comparison with Wu's work and further statements 

Quasi-compactness of Markov kernels acting on By is fully studied in [Wu04, Th. 3.11]. A 
first difference between Theorem 2 and [Wu04, Th. 3.11] concerns their proofs. The proof of 
Theorem 2 is much more direct than in [Wu04] since it uses Doeblin-Fortet inequalities. The 
next remarks show that Wu's assumptions and conclusion are different from ours, but also 
complementary. 

• (X, d) is assumed to be a Polish space in [Wu04], and Wu's topological assumptions on 
P are the following ones: 

(Al') P{x,dy) and Pv{x,dy) := V{xy^V{y)P{x,dy) satisfy Hypothesis (Al)^ 

where Wu's hypothesis (Al) (introduced in [Wu04, p. 265]) uses sophisticated parame- 
ters /3«,(-) and /3r(') for measure of non-compactness of P. Our topological assumption, 
namely P^ (for some l> 1) is compact from Bq to Bv^ is more manageable and it only 
involves the kernel P (not Py)- Furthermore, using duality {[P*Y is compact from B'q 
to B'y)^ our compactness assumption corresponds to one of the standard hypotheses of 
[Hen93]. 

• The contraction-type condition in [Wu04, Th. 3.11] involves equivalent functions to V. 
When V{x)^+co as (i(x, xq) — >■ +00, it writes as follows (See {a.i) 44> {a.iii) in [Wu04, 
Th. 3.11]): there exists an equivalent function W (i.e. c~^V <W< cV) such that 

r{W) := hmsup ^^^/\ < 1. 
x^oo W{x) 

In practice, finding such a function W is not easy, excepted of course when we directly 
have PV < 6V + dlx with some 6 G (0, 1) and d > (in this case W = V). Our 
contraction- type condition is: 

6v{P) < 1. 

That all the iterates of P are involved in the definition (3) of 5v{P), and so in a bound 
of ress{P), is quite natural from the spectral definition of ress(P). Moreover, since the 
definition of Sv{P) is only based on the function V (not on equivalent functions), our 
contraction condition is more manageable than in [Wu04]. 



^The use of Py is crucial in [Wu04]; indeed Py is a bounded operator on Bo which has the same spectral 
properties as P acting on Bv- The statement [Wu04, Th. 3.11] is then deduced from the study of the 
essential spectral radius of bounded kernels acting on Bo, see [Wu04, Th. 3.10]. Note that when P satisfies 
Hypothesis (Al), the same holds for Pv whenever PV /V^ is bounded on X for some p £ (1, +cxd), see [Wu04]. 
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• Wu's conclusion [Wu04, (3.17)] states that rggsiP) is equal to the infimum of the 
quantities r(W) over all the equivalent functions W. Using this formula to compute 
rf,ss{P) seems to be very difficult in practice (anyway such computations are not re- 
ported in Wu's examples). Finally, combining Theorem 2 which gives the inequality 
ress{P) < ^v{P)^ ^-iid Wu's result enables us to prove in Corollary 1 that the expected 
equality ress{P) = ^v{P) holds in many case. 

Corollary 1 Assume that X ^s a Polish space, that P satisfies Condition (WD), that P^ is 
compact from Bq to By for some i > 1, that the topological assumptions (Al') of [Wu04, 
Th. 3.11] are satisfied, and finally that PV is hounded on each compact set of^. Then: 

ress{P) = 6viP). 

Proof. From Theorem 2 we know that r^ssiP) ^ ^viP)- Let r > ressiP)- From [Wu04, 
Th. 3.11] there exists an function W equivalent to V (i.e. c"^V < W < cV) such that 
PW < rW + dlx (since PW is bounded on compact sets). Iterating the last inequality 
shows that there exists e > such that: Vn > 1, P^W < r^W + e Ix- Thus we obtain 
Vn > 1, < cV"!/ + celx, so that we have for any p > r and for sufficiently large: 

P'^V < p'^V + celx- Therefore: Sv{P) < P- Since r is arbitrarily close to ress{P), so is p. 
This gives: 6v{P) < r^ssiP)- □ 

• In practice, (Al') is deduced from the following conditions (see [Wu04, p. 265]): 

(A2') P, Py are Feller and P^, Py are strongly Feller (for some £> 1). 

Recall that a nonnegative kernel T{-,dy) on X satisfying sup^-gx ^(^; ^) < oo is said 
to be Feller (respectively strongly Feller) if, for every bounded continuous (respectively 
measurable) function / : X— t-M, the function (T/)(-) := J f{y)T{-,dy) is continuous. In 
the general setting of Markov operators, it seems to be difficult to compare our compact- 
ness assumption with Hypothesis (Al'), and even with (A2'). However, for absolutely 
continuous kernels, Hypothesis (A2') is stronger than our compactness assumption as 
explained below. 

Conditions (K). (X, d) is a separable metric space equipped with its Borel cr -algebra X . Every 
closed ball of\ is compact. For some xq € H we have limd(a;,a;o)--i'+oo ^(^) = +oo. Finally 
there exist a positive measure t] on (X, A') and a measurable function K : X^ — 7'[0, -|-oo) such 
that: 

VxGX, P{x,dy) = K{x,y)dr]{y). (11) 

Lemma 3 Assume that Conditions (K) hold. If P^ is strongly Feller for some £ > 1, then 
P^^ is compact from Bq to By. 

Although Lemma 3 is a classical statement, we prove it in Appendix B for completeness. 
Obviuously we deduce from Lemma 3 that, if P™ satisfies Conditions (K) for some m > 1 
and if P^ is strongly Feller for some £ > 1, then P^fm -g QQjj^pact from Bq to By. This shows 
that the compactness assumption of Theorem 2 is fulfilled in all the examples of [Wu04, 
Sect. 8]. Furthermore, Lemma 3 and Theorem 2 allows us to derive the following result. 
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Corollary 2 Assume that Conditions (K) and (WD) hold and that is strongly Feller for 
some i > 1. Then P is a power-bounded quasi- compact operator on By with 

TessiP) < 6viP). 

Remark 3 If P is given by (11) with K continuous in the first variable, then P is strongly 
Feller. Indeed, for all {x,x') G X^, we have: 

\{Pf){x')-{Pf){x)\< [ \K{x',y)-K{x,y)\dri{y). 

Since we have K{-,-) > 0, J K{-,y)dr]{y) = 1, and lim^i ^ ^ K {x' , y) = K{x,y), we deduce 
from Scheffe's theorem that lim^i /x \K{x', y) — K{x, y)\ dr]{y) = 0. This proves the desired 
statement. The previous argument even shows that{Pf, ||/||o < 1} is equicontinuous. Observe 
that, when the last equicontinuity property holds and V{x) = +oo as xq) — ?• +oo, Ascoli's 
theorem is another way to prove the compactness of P : Bq ^ By ■ 

Let us introduce a last statement which gives a suitable sum up of the present discussion, 
in particular for analyzing ours models of Markov chains in Section 3. We know from [Wu04, 
p. 270] that if P is Feller and P^ is strongly Feller for some £ > 1, and if V and PV are 
continuous, then Py is Feller and Py^ is strongly Feller, and thus (Al') holds true (see 
(A2')). Therefore, using Corollary 1 and Lemma 3, we obtain the following statement. 

Corollary 3 Assume that X is a Polish space, that Conditions (K) and (WD) hold, that V 
and PV are continuous, that P is Feller and P^ is strongly Feller for some £> 1. Then P is 
a power-bounded quasi-compact operator on By with 

ress{P)=5y{P). 

2.4 Prom quasi-compactness on By to ^/-geometrical ergodicity 

Recall that a Markov chain {Xn)n&i with transition kernel P is ^-geometrically ergodic if P 
has an invariant probability measure vr such that 

(VGl) tt{V) < oo 

(VG2) hm sup ||P"/-7r(/)||y = 0. 
"^~/eev,||/||v<i 

Let n denotes the rank-one projection defined on By by: 11/ = 7r(/)lx- Note that the 
condition (VG2) is equivalent to the convergence to of — Il\\y, the operator norm 
associated with || • \\y. Moreover, using — LI = (P — 11)"', it can be shown that the 
convergence is geometric, that is, there exists p € (0, 1) and Cp G (0, -|-oo) such that 

||P"-n||y <Cpp". (12) 
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Recall that the infimum bound of the positive real numbers p such that (12) holds has been 
denoted by pv{P) and called the convergence rate of P on By. 

In this subsection we propose a result which makes explicit the relationship between the 
quasi-compactness of P and the IZ-geometric ergodicity of the Markov chain {Xn)n£n with 
transition kernel P. Moreover, we provide an explicit formula for pv{P) in terms of the 
spectral elements of P. A key element is the essential spectral radius ress{P)- For general 
quasi-compact Markov kernels on By^ the result [Wu04, Th. 4.6] provides interesting addi- 
tional material on peripheral eigen-elements. 

Theorem 3 Let P he a transition kernel which has an invariant probability measure vr such 
that vr(y) < oo. The two following assertions are equivalent: 

(a) P is V -geometrically ergodic. 

(h) P is a power-hounded quasi-compact operator on By, for which X = 1 is a simple eigen- 
value (i.e. Ker(P — I) = C • Ix) and the unique eigenvalue of modulus one. 

Under any of these conditions, we have py{P) > ressiP)- In fact, for tq G (re5s(P),l), 
denoting the set of all the eigenvalues \ of P such that rg < |A| < 1 by Vrg, we have: 

• either py{P) < when Vr^ = 0, 

• or py{P) = max{|A|, A G Vr^} when Vro / O- 

Moreover, if Vro = ^ f'^'^ ^0 G {ress{P), 1); then py{P) = ress{P)- 

From Definition 1, for any tq € {ressiP)-, 1)) the set of all the eigenvalues of A of P such that 
^'o < |A| < 1 is finite. 

Remark 4 The property that P admits a spectral gap on By in the recent paper [KM 11] 
corresponds here to the quasi-compactness of P (which is a classical terminology in spectral 
theory). The spectral gap in [KMllj corresponds to the value 1 — py(P). Then, [KMll, 
Prop. 1.1]) is a reformulation of the equivalence of properties (a) and (b) in Theorem 3 
under tp-irreducibility and aperiodicity assumptions (see also [KMll, Lem. 2.1]). The last 
statements in Theorem 3 provide the value of the convergence rate py{P) for V -geometrically 
ergodic Markov chains from the essential spectral radius ressiP) and the (possible) eigenvalues 
A such that r^ssiP) < |A| < 1. 

Proof Note that we have By = ClxB Hq, with Hq := {/ G By : ir{f) = 0} (write 
/ = vr(/)lx + (/ — ^(/)lx))- Since vr(y) < oo, vr defines a bounded linear form on By, so 
that Hq is a closed subspace of By. From the invariance of vr, we obtain that P{Hq) C Hq. 

Now assume that (a) is fulfilled. Then for any p G {py{P), 1) we have from (12): 

sup = 0(p"). 
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It follows from Definition 1 that P is quasi-compact on with r^ssiP) ^ PviP)- The fact 
that P is power-bounded on By easily follows from (a). 

Conversely, assume that (6) holds and prove that Property (a), together with the claimed 
properties on pv{P), are fulfilled. Since P is Markov and power-bounded on By, we have 
r{P) = 1. From Definition 1 and the assumption on the peripheral eigenvalues of P, we 
obtain for any tq G {ress{P), 1): 

= C Ix e ( ©Aev.„ Ker(P - XI)p^) H, (13) 

where H is a closed P-invariant subspace of By such that sup^gj^ ||/j||^<]^ HP^/iHy = 0(ro"). 
Let / G By- Then we have 

/-7r(/)lx= /a + /i, (14) 

AGVro 

with fx £ Ker(P — XI)^^ and h £ H, and there exist some constants cx and ch (independent 
of /) such that H/aHf < CA||/||y and \\h\\y < cnUfWy (since the projections associated with 
the decomposition (13) are continuous). 

When Vro = 0, then (14) yields 

IIP"/ - 7r(/)lx||y = ||P"(/ - ^(/)lx) ||y < 0(ro") ||/||y. 

Thus Property (a) holds and py{P) < tq. If Vro ~ ^ ^0 ^ (^ess(-P)) 1)) then py{P) < 

ress(P), so that py{P) = ressiP) from the proof of (a) =^ (6). 

When Vro '^j define v := max{|A|, A G Vro}- have for A G Vro P\ 
ll^'"/A||y = ll(^'-A/ + A/)"M|y < E ( Jl^r"'ll(^-^^)'llv'll/AllF 

^ 0|Ar'll(^-A/)'||y)||/||y. 

Then for each k = 0, . . . ,px-l, we have (fc)|A|"-*^ = 0(n'=|A|") < 0{n''v'^). Thus \\P'^fx\\v = 
0{p^) WfWv for any p G (z^, 1). From (14) and rg < /3, we obtain: 

IIP"/ - ^(/)lx||y < Yl II^"/a|| + ||P"/i||y < (0(p") + 0(r-o")) ||/||y = 0(p") ||/||y. 
AeVro 

Since p G (z^, 1) is arbitrary, this gives py{P) < Conversely, given any A G Vro / ^ 
such that P/ = A/, we have 7r(/) = (use the invariance of tt), and from P"/ = A"/ and 
the definition of py{P), we easily deduce that |A| < py{P). Thus z^ < py{P). □ 

The next lemma provides the existence of the P-invariant probability measure under the 
weak drift Condition (WD). This statement (see e.g. [MT93] for similar results) will be of 
interest in our examples for the use of Theorem 3. For completeness, a proof is given in 
Appendix B. 

Lemma 4 Assume that (X, d) is a separable complete metric space and that V : X — 7'[1, -|-oo) 
is continuous and such that the set {V < a} is compact for every a G (0, -|-oo). Under 
Condition (WD), there exists a P-invariant probability measure vr such that tt{V) < oo. 
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2.5 Study of characteristic functions 



As explained in Introduction, the next theorem plays an important role in our work. 

Theorem 4 Assume that the weak drift condition (WD) holds true. If X £ C is such that 
S < \X\ < 1, with 6 given in (WD), and if f £ By D Ker(P - \I)P for some p G N*, then 
there exists c G (0, +oo) such that 

In lAI p(p—l) 

I/I < cV^ (InV)^^. 

The proof of Theorem 4 is based on the following lemma. 

Lemma 5 Let A G C 6e such that S < \X\ < 1. Then 

V/ G Bv, 3c G (0,+oo), Vx G X, |Ar"(^) |(P"(^V)(2^)| < cV{x)'-^ (15) 
with, for any x G X, n(x) := |_— ^^^^J , where [-J denotes the integer part function. 

Proof. First note that the iteration of (WD) gives 

k-i , 
Vfc>l, pkNy <5''^V + d{y5^^)H<5''''V + —^lx. (16) 

j=0 

Let g G By and x G X. Using (16), the positivity of P and \g\ < \\g\\v y, we obtain with 
b:=d/{l-5^): 

Vfc> 1, |(P'^^5)(x)| < (P'^'mx) < \\g\\v {P'''V){x) < \\g\\v{d'''V{x) + b). (17) 

The previous inequality is also fulfilled with A; = 0. Next, let / G By and n G N. Writing 
n = kN + r, with /c G N and r G {0, 1, . . . , — 1}, and applying (17) to g := P*"/, we obtain 
with e := maxo<,<,v-i \\P'f\\v (use P"/ = P^^ {P' f)): 

|(P"/)(2;)| <e[<5^'^^(2;) + 6] <C[<5""(5""^^(x) + 6)] <i5~^ {5''V{x) + h). (18) 

Using the inequality 

Inl^(x) ^ . . . . Inl^(x) 
and the fact that In 5 < In |A| < 0, Inequality (18) with n := n(x) gives: 



This gives the desired conclusion with c = ^5 ^(e^'^l'*'! ^^^^ + h). □ 
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< i [{5\\\~^Y^^^ V{x) + 6 1 A|-"(^) j 

= ^ <5-^ (e^ '°^(-) '^1-1°^ + b F(x)^) 
= e {e^'' 1^1"'"^^ + 6) y (x)^ . 



Proof of Theorem 4. U f e By D Ker(P - XI), then |A|-''(^)|(P"(^)/)(x)| = so that 

(15) gives the expected conclusion when p = 1. Next, let us proceed by induction. Assume 
that the conclusion of Theorem 4 holds for some p > I. Let / G By H Ker(P — XI)'^^^. We 
can write 

min(n,p) , , 

P'^f = {P-XI + XlYf = X'' f + ( JA"-'=(P- A/)V- (19) 

For A: G {1, . . . we have fk := {P - XI f f G Ker(P - XIY+^-'' C Ker(P - A/)^, thus we 
have from the induction hypothesis : 

3c'g (0,+oo), VA;G{l,...,p}, VxGX, |/fc(x)| < c'y(x)^ (Iny(x))^^. (20) 
Now, we obtain from (19) (with n := n{x)), (20) and Lemma 5 that for all 2; G X: 

min(7i,p) 

\f{x)\ < |A|-"(^-)|(P'^(^V)(2;)| +c'y(x)^ (Iny(x))'^ lAr"^^'^^"'^) Yl 

k=l 

In |A| In |A p(p-l) 

InlAI P(p-l) I 

< C2V{x)^ iliiVix))^^+P 

with some constants ci,C2 G (0, +00) independent of x. Since p{p — l)/2 + p = p{p + l)/2, 
this gives the expected result. □ 

To conclude this section, notice that the ^-geometrical ergodicity clearly implies Condi- 
tion (WD). However Condition (WD) is not sufficient for P to be ^/-geometrically ergodic, 
even if P is assumed to be compact from Bq from By- In fact, the previous statements provide 
the following procedure to check the ^-geometric ergodicity of P and to compute an upper 
bound for its convergence rate pv{P)- Let P be a transition kernel with an invariant probabil- 
ity measure vr such that 7r(y) < 00. Theorem 2 shows that, if P (or some iterate) is compact 
from Bq into By and satisfies the weak drift condition (WD), then P is quasi-compact on 
By and r^ssiP) < ^viP)- Next Theorem 3 ensures that the ^-geometric ergodicity of P can 
be deduced from quasi-compactness provided that the following properties are satisfied : 

(i) A = 1 is a simple eigenvalue of P on By, namely Ker(P — /) = C • Ix; 

(ii) A = 1 is the unique eigenvalue of P of modulus one on By. 

Finally Theorem 4 can be useful to check (i)-(ii), and in a more general way to investigate 
the sets Vro of eigenvalues of P given in Theorem 3 in order to obtain an upper bound for 
the convergence rate py(P). This procedure is applied in the next section. 

3 Applications to discrete Markov chains 

In this section, we are concerned with discrete Markov chains. For the sake of simplicity, we 
assume that X := N throughout the section. Let P be a Markov kernel on N. The main focus 
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is on the estimation of the essential spectral radius ressiP) from Condition (WD): a general 
statement is derived from Corollary 2 in Subsection 3.1, and applications to random walks 
(RW) with bounded state-dependent increments are presented in Subsection 3.2. 

For irreducible and aperiodic discrete Markov chains, criteria for the ^-geometrical er- 
godicity are well-known from the literature using, either the equivalence between geometric 
ergodicity and ^-geometric ergodicity of N- valued Markov chains [HS92, Prop. 2.4], or the 
strong drift Condition (D) with a small set S [MT93]. In Subsection 3.3, we just explain as 
an alternative way how the quasi-compactness combined with irreducibility and aperiodic- 
ity conditions provide the ^-geometrical ergodicity. Finally the procedure mentioned at the 
end of the previous section (see (i))-(ii)) is applied to compute the convergence rate of some 
random walks (see Example 4 and Subsection 3.4). Such computations are not reported in 
Wu's work, excepted for the examples [Wu04, Ex. 8.3-8.4] corresponding to the specific case 
P(0, 0) := 1 — g in Subsection 3.4. Wu obtained the rate pviP) for these two examples as 
special instances of discrete reflected random walks. These processes, also called Lindley's 
random walks, are investigated in Subsection 5.2, in which we obtain the rate of convergence 
with explicit constant for general discrete Lindley's random walks. 



3.1 Quasi-compactness of discrete Markov chains 

Let P = {P{i,j))ij£f^'2 be a Markov kernel on N. The function V : N — )-[l, -|-oo) is assumed 
to satisfy 

(PV)(n) 

lim V{n) = +00 and sup — — < oo. 

Corollary 4 The two following conditions are equivalent: 

(a) Condition (WD) holds with V; 

(b) L := mf^(^^)^ < 1 where In := limsup^^+^{P^V){n)/V{n). 

In this case, P is power-bounded and quasi-compact on By with 

ress{P) = 6v{P) = L. 

Proof. That P is power-bounded and quasi-compact on By with r^ssiP) = Sv{P) under 
(WD) follows from Corollary 3 since P is strongly Feller in the discrete state space case. 

Let us prove the equivalence (a)4^(b), as well as the equality Sv{P) = L. First, Condi- 
tion (WD) clearly gives ^a? < 5^ < 1 (with 5 in (WD)), thus L < 5viP) by definition of 
6v{P)- Conversely, assume that L < 1: there exists N > 1 such that ipf < 1. Let 5 be such 
that iN < < 1. Then there exists no S N such that: Vn > no, {P^V){n)/V{n) < . 
Hence 

P^V<6''V + d, withd:= max (^"^^^^'^ 



0<i<no V{i) 
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This proves (WD), and 6v{P) < {InY/^ since 6 is arbitrary close to {£n)^^^ ■ In fact, the 
last argument shows that dv{P) < {(-nY^^ provided that 1^ <1. From definition of L, there 
exists a sequence {Nk)k>o such that L = \mik{£i^^_Y^^'' . Thus we have 5v{P) < {^N^Y^^'' for 
k large enough. Thus 6y{P) < L. □ 

In the next subsections. Corollary 4 is applied to random walks on N with the following 
special sequence := (7")neN for some 7 G (l,+oo). The associated weighted-supremum 
space B-y = By^ is defined by: 

^7 ■■= {(/W)nGM G : sup7-"|/(n)| < 00}. (21) 

neN 



3.2 Quasi-compactness of RW with bounded state-dependent increments 

Let us fix 6 E N*, and assume that the kernel P on X := N satisfies the following conditions: 

VzG {0,...,6-l}, ^P(i,i) = l; 

i>o 

Vi>6,ViGN, P{t,j) = r ^.^ (22) 

[aj_i(z) if |i - j| < 

where (a_b(i), . . . , ab(i)) G [0,1]^^"^^ satisfies X^^;=_bafe(*) = 1 for all i > b. This kind of 
kernels arises, for instance, from time-discretization of Markovian queueing models (see a 
basic example in Remark 6). 

Proposition 1 Assume that for every k £ such that < 6 

lim afc(n) = G [0,1], (23a) 

n 

and that 7 G (1, +00) is such that 

b 

(/)(7):= Y.akj'<l (23b) 
k=-b 

VzG{0,...,6-l}, J^P(i,i)7^' <(X). (23c) 
i>o 

Then P is power-bounded and quasi- compact on By with 

TessiP) < Hi). 

Proof. Set (jinil) '■= Yl\=-h^k{n) 1^ ■ We have (PKy)(n) = (f)n{'^)V-y{n) for each n>b, hence 
we obtain from (23a) 

hmsup ' < (pi-y). 
The conclusion of Proposition 1 then follows from Corollary 4 using (23b)-(23c). □ 
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Assume that oq 7^ 1- Let (p^^^ be the k-th derivative of (p. It is checked in Appendix C.2 
that there exists 1 < £ <2b such that 

yk e {1,... ,£-!}, (/.W(1) = and </,W(l)/0, (24) 

according that the first condition is removed when i = 1. Since = 1, a sufficient condition 
for (23b) to hold for some 7 S (1, +00) is that 

(P^^\l)<0. (25) 

Example 1 (State-dependent birth-and-death Markov chains) When b := 1 in (22), 
we obtain the standard class of state- dependent birth-and-death Markov chains. Namely, the 
stochastic kernel P is defined by: 

yn> 1, P{n,n - 1) := pn, P{n,n) := rn, P(n, n + 1) := g„, 

where the real numbers {pn, r^, Qn) £ [0, 1]^ and Pn i~n + Qn = 1- Assume that the following 
limits exist: 

limpn := p£ (0, 1], hmr„ := r G [0, 1), hmg„ := q. 
n n n 

If J G (l,+oo) is such that 

4>{l) ■= - + r -\- q-y < I and ^ P(0, n)7" < 00, 

then it follows from Proposition 1 that P is power-bounded and quasi-compact on with 

p 

' ess 

(P) < - + r + 57. 

7 

The conditions 7 > 1 andp/^+r+q^ < 1 are equivalent to the following ones (use r = 1—p—q 
for ii)): 

(1) either p > q > and 1 < 7 < p/q; (ii) or q = and 7 > 1. 



• When p > q > and 1 < 7 < p/q: ^/ X]„,>o -f(O) ^)7"' < co, then P is power-bounded 
and quasi-compact on with ress{P) ^ (f'il)- Set 7 := sjpjq. Then 

min(/)(7) = (/>(7) = r + 2VM e (?^^ !)• (26) 

7>1 

Consequently, 2/X]„>o -P(0, f^)(7)" < 00, then the previous conclusions holds for j := 7, 
with essential spectral radius on B^ satisfying 

ress{P) < r + 2VM- 

• When q := and 7 > 1; if '^n>QPiO,n')^"' < 00, then P is power-bounded and quasi- 
compact on B^ with 

ress{P) < (Pil) = Ph + 

Such a case is illustrated by the next example. 
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Example 2 (Simulation of a Poisson distribution with parameter one) The Markov 
kernel P on X := N defined by 

P(0,0)=P(0,1) = ^ 

In 1 
Vn>l, P(n,n-1):=-, Pin,n) := nn,n + l):=^^. 

arises from a Hastings-Metropolis sampler of a Poisson distribution. We have p = r = 1/2 
and q = with the notations of Example 1. Hence, for each 7 G (1, +00), P is power-bounded 
and quasi- compact on B-y and 

ressiP) < 1/2 + 1/(27). 



Remark 5 (Random walks with i.d. bounded increments) Consider the case when the 
increments 0^(71) do not depend on the state n, that is when the kernel P is 

ViG{0,...,6-l}, ^P(i,j) = l; V^>6,ViGN, P{t,j) = 

where (a„;,, . . . , a;,) G [0,1]^^^^^ and 'Y^k=~b^k = 1- Obviously the statements of Example 1 
apply but some additional facts can be deduced for such Markov chains. First note that 

V7 G (1, +00), ViV > 1, Vn > Nb, {P^Vy){n) = ()){-f)^ V^{n). (27) 

Consequently, under the assumptions (23c) and ^(7) < 1 where (f)[-) is given by (23b), we 
obtain from Corollary 4 that Condition (WD) is fulfilled with and 

r,ss{P) = Sv,{P) = m- (28) 

Moreover, it is shown in Appendix C.2.2 that, under the assumptions 7^ 1 and (23c), 
Condition (WD) holds true with V.yg for some 70 G (1,7] if and only if (f>^^\0) < 1 (see (24)- 
(25)). Finally, for the birth- and- death Markov chains, that is when b := 1, the convergence 
rate can be computed (see Subsection 3.4)- 



( Oj^i if \i - j\ <b 
1 if \i-j\>b 



3.3 ^-geometrical ergodicity for discrete Markov chains 

Let P = {P{i, j))ij^fq2 be a Markov kernel on N. The following irreducibility and aperiodicity 
conditions for discrete Markov chains are well-known. For any {i,j) G N^, define 

7^iJ:={n>l:P"(i,j)>0}. 

The Markov kernel P is said to be irreducible if 

V(i,i)GN, TZij^d}, {!) 

and to be aperiodic if 

3 i G N, 7^i,i - 7^i,i := {n - m, (m, n) G 7^^,^ x TZi^i} = Z. {A) 
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Since TZi^i is stable under addition from the Chapman-Kolmogorov equation, the subgroup 
of Z generated by TZi^i coincides with TZi^i — TZi^i. Hence the aperiodicity Condition {A) is 
equivalent to the usual one: the largest element d = d{i) E N* such that TZi^i C (i-N* (i.e. the 
g.c.d. of TZi^i), called the period of i, is equal to 1. If P is irreducible then each state j G N 
has the same period. 

Corollary 5 Under Condition (WD), P has an invariant probability measure vr such that 
Tr{V) < oo. If the additional Conditions (I)-{A) hold true, then P is V -geometrically ergodic. 

The previous statement is well-known. It can be derived from quasi-compactness (note that 
the first assertion follows from Lemma 4): apply Corollary 4 and Theorem 3 with Condi- 
tions (I)- (A) (see Appendix C.l for completeness). 

Under the additional Conditions (I)- (A), all the statements of Subsection 3.2 can be 
completed in order to find again the V^-geometric ergodicity. For instance, in Example 1, 
the V^-geometrical ergodicity holds when P(0,0) G (0,1) and Pn,gn > for all n > 1.^ 
In Example 2, Conditions (I)- (A) are automatically fulfilled so that P is Ky-geometrically 
ergodic without additional assumptions. 

Note that the irreducibility condition is not necessary for P to be ^/-geometrically ergodic: 
in this case the use of Theorem 3 (via Corollary 4) is of interest to obtain the ^-geometric 
ergodicity as illustrated in the following simple example. 

Example 3 (An instance of binary RW) Assume that 

Vn > 1, P{n, n — 1) := p £ (0, 1], P{n, n) := r = 1 — p. 

Under the assumptions 7 > 1 and X]„>o ^(0) "')7" < 00, we know from (28) that r^ssiP) = 
p/j + r. Note that Condition (X) is not automatically fulfilled in this instance. Anyway, 
without additional assumptions, P is V^/- geometrically ergodic. Indeed Theorem 3 applies. 
First the equation Pf = f leads to: Vn > l,/(n) = f{n — 1), so that f is constant. Hence 1 
is a simple eigenvalue of P. Second, given A G C, |A| = 1, A 7^ 1, any solution of Pf = Xf 
is of the form: f = {f{0)zx^)n^fq with z\ := p/{\ — r). From |A — r| > 1 — r = p, we obtain 
\zx\ < 1, so that the equality (P/)(0) = A/(0), namely A/(0) = /(O) X]„>o -f(0, f^)zA" is only 
possible when /(O) = 0. Hence 1 is the only eigenvalue of modulus one. 

Finally recall that, as it was outlined at the end of Section 2, quasi-compactness is espe- 
cially of interest for bounding the convergence rate of P. Example 4 below is a first simple 
illustration of this fact. Other applications to birth-and-death Markov chains are proposed in 
the next Subsection. 

Example 4 (An instance of RW with unbounded increments) Let us point out that 
Corollary 4 o-nd Theorem 3 may be also useful for random walks on X := N with unbounded 

^Note that, if -P(0, 0) = 0, then the period d(0) of i may be equal to 2. For instance this fact holds when 
P(0, 1) = 1, and A = —1 is then an eigenvalue of P: P is quasi-compact on B=j, but it is not V^^-geometrically 
ergodic. 
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increments. For instance, let P be defined by [MS95] 

Vn > 1, P(0, n) := q„,, Vn > 1, P(n, 0) := p, -P(n, n + 1) := q' = 1 — p, 

with p G (0, 1) and qn G [0, 1] such that Yln>i In = ^- For 7 G (1, +00) and Vy := (7'^)„gN, 
we have: Vn > 1, (PKy)(n) = p + q^"~^^ = (p/7" + (?7)V^(n). Thus, if ^ £ {1,1/q) and 
Yln>i^nn^ < 00, then Condition (WD) holds with and we have 5y^{P) < (77. Therefore, 
under the previous conditions, if follows from Corollary 4 that P is power-bounded, quasi- 
compact on B^i and 

ress{P) < qi- 



No additional assumptions are required to obtain the V^-geometric ergodicity: P is V^- 
geometrically ergodic provided that 7 G (1,1/g) and X^„>i 9ra7" < oo- Moreover the conver- 
gence rate pv^ (P) of P on satisfies: 

PV,{P) <n'i&x{qj,p). (29) 

Proof of (29). Theorem 3 is applied with any tq > max((77,p). Let A G C 6e such that 
max((77,p) < |A| < 1, and let f G B-y, f ^ 0, be such that Pf = A/. We obtain f{n) = 
{X/q)f{n — 1) — pf{0)/q for any n>2, so that 

' X-qJ^X-q- 

Since f £ B-y and \\\/q > 7, we obtain /(I) = p/(0)/(A — q), and consequently: Vn > 
1, /(n) =p/(0)/(A - q). Next the equality A/(0) = (P/)(0) = En>i 9n/(n) gives: A/(0) = 
p/(0)/(A — q) since X],„>i 9n = 1- We have /(O) 7^ since we look for a solution / 7^ 0. 
Thus X satisfies X^ — qX — p = 0, namely: A = 1 or A = —p. The case X = —p has not to be 
considered from assumption. If X = 1, we have f{n) = /(O) for each n G N, so that 1 is a 
simple eigenvalue. We have proved that 1 is a simple eigenvalue of P on By and that A = 1 
is the only eigenvalue of P on By such that ma.x{q'y,p) < \X\ < 1. Then Theorem 3 gives the 
estimate (29) of the convergence rate. Note that p cannot be dropped in (29) since X = —p 
is an eigenvalue of P on B^ with corresponding eigenvector (up to a multiplicative constant) 

fp ■■= -P, •••)• 



3.4 Study of the convergence rate for the birth-and-death Markov chains 

We consider real numbers p,q,r G [0, 1] such that p + r + g = l, p>g>0, and we assume 
that P is defined on X := N by 

Vn > 1, P{n, n — 1) := p, P{n, n) := r £ [0, 1), P{n, n + 1) := q, 

P(0,0) G (0,1), ^P(0,n)(7)'^<oowhere7:=./^G(l,+oo). ^^^^ 
n>0 " ^ 
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Let := (7")nGN- The weighted-supremum space := By- associated to Vy is defined 
in (21). Note that Conditions (I)- (A) hold true. We know from Corollary 5 that P is 
1^-geometrically ergodic and from (26) (28) that 

ressiP) = 6v^{P) = r + 2^. 

As illustrated afterwards, thanks to Theorem 3 and Lemma 6 below, the last estimate on 
the essential spectral radius ressiP) is relevant to compute the convergence rate pv~{P)- 

Lemma 6 Assume that Conditions (30) hold true. If f is a nontrivial eigenvector in 
associated with a complex eigenvalue X of P such that r + l^fpq < |A| < 1 then 

3qi G C \ {0}, Vn > 0, f(n) = ax za", (31) 

with z\ satisfying the following conditions: 

\zx\<i, (32a) 
qzx^ + {r-X)zx+p = 0, (32b) 
^P(0,n)zA" = A. (32c) 

n>0 

Proof Let A G C be such that r + 2^ < |A| < 1. Let / G / / such that Pf = Xf so 
that 

Vn>l, Xf{n)=pf{n-l)+rf{n) + qf{n + l). (33) 
Let us denote by zx , z'-)^ the two complex solutions of the characteristic equation 

qz'^ + (r — X)z + p = 0. 

Observe that zxz'^ = p/q = 7^. Recall that the solutions of (33) are of the form, either 
f{n) = qiZa" + Oi2z'y^"' if Zx / 2;^' /(^) ~ oii^x^ + a2'nzx"' if zx = z'-^^ with ai, 02 G C. 

We have \z-x\ 7^ Indeed, Theorem 4 applied with p := 1 and 5 := r + '2^/pq implies 

that I/I < cVy'^ with r := ln|A|/ln5 G (0,1) and some constant c. Consequently we have 
|cki-2a" + 02-2a"| ^ c^'^^ in case zx 7^ z'^, and IoiZa*^ + Oi2nzx"'\ < c-y"^" in case zx = Zy If 
\zx\ = \z'^\, then we would have l^^l = 1-2^1 =7) but the two previous inequalities then easily 
imply that ai = 02 = 0, that is / = 0. 

From |za| 7^ \^'x\j suppose that (for instance) \zx\ < 7 and \z'^\ > 7. Since /, 

{zx^)neN ai'e in Btf and {z'^^)n is not in B^, we obtain: Vn > 0, f{n) = aiZx^- Since / 7^ 
(i.e. ai 7^ 0), the equation (P/)(0) = A/(0) implies that zx must satisfy (32c). □ 

Proposition 2 In addition to Conditions (30), the boundary transition probabilities are as- 
sumed to satisfy, for some a G (0, 1).' 

P(0,0):=a, P(0, l):=l-a. 

Then P is Vy- geometrically ergodic. Furthermore, defining oq := ^ — q — ^/pq, the convergence 
rate pv~{P) of P is given by: 
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when a £ [gq, 1); 



when a G (0, ao] .' 



(a) in case 2p < (l — q + y/pq) ^ • 

py^(P) =r + 2VM; 

(h) in case 2p > (l — g + y/pqf' , setting ai := p — ^Jpq — \Jr{r -\- 2y/pq) : 

p{l - a) 



pv^iP) = a + 

^ a-l+q 

PvAP) = r + 



when a G (0, ai] 
when a G [oi, ao). 



(34) 



(35) 



(36a) 
(36b) 



When r := in the previous proposition, we have = ai = p — y/pq = (p — (?)/(! + y^/p), 
and it can be easily checked that 2p > {1 — q + ^Jpqf' ■ The properties (34) (36a) (36b) then 
rewrite as: 



pq ~\~ {a — ^)^ 
py^ (P) = \a-p\ " ^ 

pvAP) = 2y/pq when a G (oq, 1). 



(37a) 
(37b) 



Using Kendall's theorem, the properties (37a)-(37b) have been proved for a < p in [RT99] 
and [Bax05, Ex. 8.4]. For a > p, (37b) can be derived from [LT96] using the fact that P is 
stochastically monotone. Our method gives a unified and simple proof of (37a)-(37b), and 
encompasses the case r ^ 0. 

Proof of Proposition 2. By elimination, given some A G C, a necessary and sufficient condition 
for the two following equations: 



qz^ + (r — X)z + p = 0, 
a + (1 — a)z = A. 

to have a common solution z G C is that 



(38a) 
(38b) 





1 — a a 


-A 






1 


-a I 





= 


1 


— a a — A 


= (1- 


A) 




1 


a — A 




q r 


-A p 








q 1 


P 








= (1- 


A) [(A 


-a)(l- 


a-q 



(39) 



Assume that a ^ 1 — q. Then A = 1 is a solution of (39) and the other solution of (39), 
say A(a), and the associated complex number in (38b), say z{a), are given by: 



A(a) := a + °^ G M and z{a) := ^— G 

^ ' a-l+q ^ ' a+q-l 



(40) 
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Now, let A € C be such that r + 2yjpq < |A| < 1, and assume that there exists / G B^y, 
/ 7^ 0, such that Pf = A/. Then Lemma 6 gives / := {zx")n>o (up to a multipUcative 
constant), with zx £ C satisfying \zx\ < 7 and Equations (38a)- (38b). Thus we have A = A(a) 
and zx = z{a), with A(a) and z(a) given by (40). Conversely, we have Pfa = A(a)/a with 
fa = (•z(fl)")n>o since, by definition, z{a) satisfies the equations (38a)-(38b) associated with 
A = A(a). Now we must find the values a € (0, 1) for which we have r + 2yjpq < |A(a)| < 1 
and \z{a)\ < 7. This is the relevant question since Theorem 3 gives the following properties: 

(i) if r + 2y/pq < |A(a)| < 1 and \z{a)\ < 7, then we have pv~{P) = |A(a)| since A(a) is the 
only eigenvalue A of P on Biy such that r + 2^Jpq < |A| < 1 (apply Theorem 3 with any 
ro such that r + 2^ppq < tq < |A(a)|), 

(ii) if A(a) or z{a) do not satisfy the previous conditions, then we have pv~{P) = r + 2y/pq 
since there is no eigenvalue A of P on Bj such that r + 2.ypq < |A| < 1 (apply Theorem 3 
with any tq such that r + 2y/pq < tq < 1). 

First, observe that 

\z{a)\<j ^ \a-l + q\>^. (41) 
Hence, if a G (ao, 1) (recall that qq := 1 — q — ^/pq), then \z{a)\ > 7. Then (ii) gives (34). 

Second consider the case a G (0, oq]. Then we have \z{a)\ < 7, and we have to study A(a). 
Note that A'(a) = 1 —pq/ (a — 1 + g)^, so that the function a 1— )• A(a) is increasing on (—00, cq] 
from —00 to A(ao) = r — 2yjpq. Thus 

VaG(0,ao], \{a) < r - 2yfpq < r ^ 2^fpq. 

and the equation A(a) = — (r + 2^/pq) has a unique solution a\ G (— oo,ao). For the 
continuation, it suffices to have in mind that oi < oq and A(ai) = — (r + 2y/pq), that 
A(0) = p/{q - 1) G [-1, 0) and finally that 

A(0) - A(ai) = p/{q - 1) + r + 2VM = ' - I? - 2p 

1 - q 

When 2p < {I — q + -^/pq)"^, (35) follows from (ii). Indeed |A(a)| < r + 2^fpq since 

Va G (0, ao], -(r + 2^) = A(ai) < A(0) < A(o) < r + 2^. 
When 2p > {I — q + ^/pq)'^, we have ai G (0, oq] and: 

• if a G (0,ai), then (36a) follows from (i). Indeed r + 2yjpq < |A(a)| < 1 since 

Va G (0, ai], -1 < A(0) < A(a) < A(ai) = -(r + 2^) ; 

• if a G [ai,ao], then (36b) follows (ii). Indeed |A(a)| < r + 2^Jpq since 

-(r + 2Vm) = A(ai) < A(a) < r + 2^. 
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It remains to study the special case a = 1 — q. Then A = 1 is the only solution of (39). 
Again let A G C be such that r + 2^ypq < |A| < 1, and let / 6 B^, / / 0, such that 
Pf = A/. Then Lemma 6 gives / := {zx^'')n>o, with zx G C satisfying Equations (38a)-(38b), 
thus Equation (39). Consequently there is no eigenvalue of P such that r + < |A| < 1. 

Theorem 3 applied with any ro G (r + 2^ypg, 1) then gives pv^{P) = r + □ 

Remark 6 Let us consider the time-discretised M/M/1 queue obtained using the uniformiza- 
tion technique [HS92, Section The arrival and service rates are denoted by (3 > and 

/i > respectively. For < h < l/(/3 + n), the kernel Pfi is defined by P^ = I + HQ where 
Q is the generator of the continuous time birth- and- death process {Xt)t>o of the number of 
customers in a M/M/1 queue, so that 

Ph{0,0):=l-f3h, Ph{0,l) := ph; 
Vn > 1, Ph{n,n - 1) = fih, P/,(n, n) := 1 - /i(/3 + /i), Ph{n,n + 1) = ^h. 

Assume that /3//x < 1 which is the ergodicity condition of the M/M/1 queue. Note that 
fih, (3h, 1 — /i(/3 + /i), 1 — f3h stand for p, q, r, a with the notations of Proposition 2 and that 
p > q, a = Pfi{0,0) G (0, 1) and a + g — 1 = 0. Therefore, for any < /i < l/(/3 + /x), P^ is 
V^- geometrically ergodic with 7 = v/V/^ ^^'^ 

pV,{Ph) = l-h{^-y^f. 

Note that pv~{Ph) is decreasing as h growth to l/(/3 + p). The minimum (not attained) 
2^/pq = 2hyJ p^P = 2\J pf3/{p + /3) would be obtained as h := 1 / {(3 -\- p) but in this case r := 
and we retrieve a binary random walk (and its convergence rate py^ = 2^ pf3 ) corresponding to 
the embedded Markov chain associated with the birth- and- death Markov process. This last value 
was found to be the L,^ -convergence rate in [LT96] using the monotone structure of the Markov 
chain and to be the l/^ -convergence rate in [RTOl] from the equality of the two rates using the 
reversibility of the model. Finally, note that exponential bounds for {Xt)t>Q are easily derived 
using that its semi-group {Pt)t>o satisfies Pt = ex.p(Qt) = ^^Lo Ph'^ ^^vi~t/h){t/h)^ /kl and 
that Ph has the same invariant probability measure than (Xt)t>o for any h < l/(/3 + p). 
Indeed, we obtain that for any < p < {y/p — v^)^- 

V/ G \\Ptf - 7r{f)\\v, = snp^-^^^IMzAIA = 0{eM-tp)). 

Note that (y^— y^)^ is know to be the 1l?'{'k) -spectral gap for such a process (see e.g. [KarOO]). 



4 F-geometrical ergodicity of iterated function systems 

In this section we assume that (X, d) is a Polish space equipped with its borel cr-algebra X. 
Let (V, V) be a measurable space. Let us first recall the definition of an iterated function 
system (IFS) of Lipschitz maps (see [DF99, Duf97]). 

Definition 2 (IFS of Lipschitz maps) Let (i?„,)n>i be a sequence of Y -valued i.i.d. random 
variables, with common distribution denoted by v. Let be a ^-valued r.v. which is assumed 
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to he independent of the sequence {'&n)n>i- Finally, let F : (V x X, V (8> A')— 7'(X, A") he 
jointly measurahle and Lipschitz continuous with respect to the second variable. The associated 
iterated function system (IFS) is the sequence of random variables {Xn)n£N which, given Xq, 
is recursively defined by: 

Vn>l, Xn:=F{^n,Xn-i). (42) 



(43) 



Clearly (X„)„gN is a Markov chain, with transition kernel P: 

yx £X, yA£ X, P{x,A) =E[1a{F{^i,x))]= [ 1a{F{v,x)) du{v). 

Let Xq G X be fixed. For any b G [0, +oo), we set 

Vx G X, p{x) := 1 + d{x,XQ) and Vf,(x) :=p{x)^. 

We simply denote by {Bb, \ ■ \b) the weighted-supremum Banach space By^ associated with 
Vb{-), that is 

:= (/ : X^C measurable such that |/|b := sup ^-^j^ < oo j . (44) 
If tp : (X, d) — 7'(X, d) is a Lipschitz continuous function, we define 

m :=.up{ ''''^^g;^fa» .(..,)gX^.^,}. (45) 

Let a G [1, +oo). We denote by Ca the following space: 

C.:={f:li^C: := { ;^^_|M_iM_ , , . ^ ,} < o.} . 

(46) 

Such Lipschitz-weighted spaces have been introduced in [LP83] to obtain quasi-compactness 
of Lipschitz kernels, see also [MR89, Duf97, Ben98, HHOl]. 

Note that, for / G we have for all x G X: |/(x)| < |/(xo)| + 2""^ ma(/)p(x)". Thus: 

I f(x)\ 

yfeCa, l/U :=sup <oo and Ca C Ba- 

Ca equipped with the norm ||/||a := ^a{f) + I /la is a Banach space. 

In Subsection 4.1 we give standard contraction/moment conditions, called (Ca), for P to 
have a geometric rate of convergence on Ca- In Subsection 4.3 the passage to the V^-geometric 
ergodicity is investigated. As already mentioned in Introduction, this section is close to [Wu04, 
Sect. 7.2-8]. Under Conditions (Ca), an IFS satisfies the contractive property [Wu04, (7.2)] 
with respect to the Wassertein distance introduced by Wu. But his topological hypothesis on 
P is replaced by our more general compactness assumption on P^ : Bo^Ba (for some £ > 1). 
The bounds obtained in Corollary 6 on r^ggi^P) and PYa{,F) (for P acting on Ba) are the same 
as in [Wu04]. The rates of convergence with explicit constants, obtained in Subsection 4.2 
for discrete Lindley's random walks and in Subsection 4.4 for autoregressive models, are not 
reported in [Wu04]. 
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4.1 Basic inequalities for IFS 



For all X E X, G V and . . . , G V" (n G N*), define: 

F^x:=F{v,x) and := L(F^) (47a) 

Fv„:vi Fv^°---° Fvi and : vi) := L{F^^.,^^). (47b) 

By hypothesis we have L{v) < oo, and so L{vn '■ vi) < oo. Note that, for each a > 1, the limit 

ka:= lim E[L(F^„,^J"]^ 

n — > +00 

exists in [0, +oo], since the sequence (E[L(??„ : '(?i)"])„gN» is submultiplicative. Let us consider 
the following classical moment /contraction conditions: 

Conditions (Ca). For some a G [l,+oo); 

E[d(F^,xo,xo)"] < oo (48a) 

Ka<l. (48b) 

Proposition 3 (see [Duf97, Ben98]) Under Conditions (C a), there exists a unique P -invariant 
distribution, denoted by tt, on (X,X), and we have TT{d{xQ, ■)'^) < oo. 



Proposition 4 ([Duf97]) Under Conditions (Ca), the transition kernel P continuously acts 
on Ca, and for any k G (/?a, 1), there exists positive constants c = and d = such that: 

Vn>l, V/G/:a, |^'"/-vr(/)lx|a <CK"ma(/) (49a) 
Vn > 1, V/ G Ca, IIP"/ - 7r(/)lx||a < c' ll/lla. (49b) 

In particular, if ki := E[L(t?i)'^] a < 1, then 

V/ G Ca, yn > 1, |P"/ - ^(/)lx|a < ci m,(/), (50) 

where the constant ci is defined by ci :=^('^-i)^||^||i( 1 + ||vr||a)" , with 

(png(x) ( (' ( ^b, < ^V" f u 1 

4:=supsup — — — - — < oo and ||7r||6 := \ p{y) dn[y) ] forb:=l,a. 

n>lxGX Va[X) \Jx J 



Properties (49a)-(49b) and (50) can be derived from the results of [Duf97, Chapter 6]. For 
convenience they are proved in Appendix D. Note that the properties (49a) and (50) do not 
provide the V^-geometric ergodicity since they are only established for f £ Ca- Indeed, in 
general the spaces Ca and Ba do not coincide, even for countable Markov chains. 



Remark 7 Under the conditions (C a) andK[L('&l)"^]a < I, the proof of (50) (see Appendix D) 
gives the following bound for the constant ^ of Proposition 4- Given any real number 6 satis- 
fying E[L(i9i)"] < 5 < 1, choose r such that 



d{x, Xo) > T 



E 



1 + L{-di) d{x, xq) + djF^^xp, Xp) 
1 + d{x, xq) 



< 6. 
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Then we have, with := E[( max(l, L(i?i)) + d{F^-^XQ, xq) )"] , 



C < 1 + 



1 



Remark 8 From Proposition 4 we deduce the following fact. Assume that Conditions (Ca) 
holds and that / : X— t-R is such that P^f G Ca for some i £ N* . Then, for any k G (kq, 1), 
we have 

Vn > £, - ^(/)lx|a = - ^(/)lx)L < CK^-'ma{P'f), (51) 

where c = is the constant of Proposition 4- If k = ki := E[L('i9i)'^]^/" < 1, then c = ci. 
Although this is not directly connected with the Va-geometric ergodicity, such estimate may he 
of interest. This is illustrated in Corollary 8. 

Example 5 (A simple example) Let {Xn)neN be the real-valued IFS 

Xo G M, Vn > 1, X„ := 

associated with F^x := vx and with a sequence (i?n)n>i of i.i.d. random variables assumed 
to be independent of Xq . This kind of multiplicative Markov models are popular in finance. 
Let us assume that the 's have a uniform probability distribution on [0, 1] . The transition 
kernel P{x,dy) of {Xn)n>o is the uniform distribution on [0,x] if x > (resp. on [x,Q] if 
X < Q). The Dirac distribution 5o at is clearly P-invariant. Finally, setting xq := and 
d{x, y) := \x — y\, we have: Va G [1, +oo) 

E[d(F^,0,0)''] = and ki := E[L(i?i)'']^ = E[??i'^]i = ^ 



a + l 



Consequently Inequality (50) is valid. If a := 1, then the constant ci in (50) is equal to 1 
since we have p[x) = 1 + |x| and vr = Jo- 
Example 6 (Autoregressive model) In this example, we prove that Inequality (50) is ful- 
filled with the (optimal) value ki := Ka- L^t (X„)„gp} be the real-valued IFS 

Xo G M, Vn > 1, Xn := aX^-i + ^n, 

associated with F^x := ax + v where a G ( — 1,1) is fixed and with a sequence {'&n)n>i of 
centered random variables. This is the so-called autoregressive model of order 1 with an 
arbitrary centered noise. We take d{x,y) := \x — y\ and xq := 0, so that L{vn ■ vi) = \a\^ 
and \FyO\ = \v\. Then, we have = \ct\ for all a G (l,+oo). In particular we have Ka = 
E[L(t?i)"]^/" < 1. Therefore, Conditions (Ca) reduce to the moment condition E[|t9i|"] < oo, 
and under this condition. Inequality (50) holds for ki := \a\. 

Next, let us check that \a\ is the minimal value of k (and ki) such that Inequality (49a) 
(and Inequality (50)) is valid. Since i?i is centered, we have: Vx G X, E[Xi | Xq = x] = 
E[ax + = ax. In other words, we have P(f) = acf) where (p is the identity function on M 
(i.e. (j){x) := x). Note that (f) is in Ca for every a > 1, and that 7r(</)) = using P(j) = acf). 
Consequently, under the condition E[|t?i|"] < oo, Inequality (49a) holds for f := (j), and we 
can deduce from P^cf) = a"'<j) ^^^^ Inequality (49a) cannot be valid with some k < \a\. 
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4.2 Application to discrete Lindley's random walk 

Recall that a Lindley random walk is defined on X := [0, +00) by X„ := max(0, Xn-i + 
^n), where {'dn)n>i is a sequence of M- valued i.i.d. random variables independent of Xq. 
The rate of convergence of Lindley's random walks with respect to || • \\v is investigated 
in [Lun97, LT96, Wu04]. More specifically, under the assumptions E[7q^] < 00 for some 
7o G (l,+oo) and E[??i] < 0, it is proved that there exists 7 G (l,7o] such that E[7''i] < 1, 
that P is F-geometrically ergodic with V{x) = 7^, and that pv{P) = £[7''^]. The constant 
Cp of (12) associated with any p G (E[7'^^], 1) is not computed in [Lun97, LT96, Wu04]. 

This subsection is devoted to the special case of discrete Lindley's random walks. More 
specifically, under the above assumption, setting := (7"')nGN) we prove that the V^- 
geometric ergodicity property (12) holds true with the optimal rate p := E[7''i] and with 
an explicit (and simple) constant Cp. This result is based on Proposition 3 and Proposition 4 
using the distance^ d{i,j) := I7* — 7-'|, (hj) G N^. 

Let Xq be a N-valued r.v. and {'dn)n>i be a sequence of i.i.d. Z-valued r.v., independent 
of Xq. Let us introduce the sequence of N-valued r.v. (X„)„gpj defined by 

Vn > 1, Xn := max(0, Xn-i + '&n)- 

The common distribution v := (z/j)jgN of the t?„'s is assumed to be such that 

370 G (1, +00), E[7o^i] = ^Ujjl<oo and = < 0. (52) 

j>o jez 

Proposition 5 Under the assumptions (52), there exists 7 G (l,7o] such that 

Ki :=E[7'^i] < 1, 

and {Xn)n>o is V-y- geometrically ergodic with := (7")„£p^. More precisely, we have the 
following properties: 

V/ G Bv-,, Vn > 1, Vi G N, |(P"/)(i) - 7r(/)| < ciA^i"mi(/)y (53a) 

V(i,j) gN2, Vn> 1, \nXn =j\XQ = i]- 7r(l|,|)| < J^jj^ ki" (53b) 

with ci := 7r(V^). Moreover, Inequality (12) is fulfilled with p := ki and Cp := 7r(V^)(7 + 
l)/(7-l). 

Proof. The first assertion holds since ^(7) := E[7'^i] satisfies G(l) = 1 and G'(l) = E[t?i] < 0. 
To prove (53a)-(53b), we apply Proposition 4 with the distance^ 

V(i,j)GN2, d(i,j) := |y-7^|. (54) 

^Such a distance is used in [Wu04, p. 296] to compute the rate of convergence of Lindley's random walks, 
but with no explicit constants. 

*The fact that, in Proposition 5, the geometrical ergodicity is directly deduced from Proposition 4 is very 
particular. This is due to the choice of the distance in (54). 
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Note that we have with xq = 0: Vi € N, p{i) := 1 + d{i,0) = 7*. Thus the space By^ 
corresponds to Bi in (44). Next observe that the spaces Bi and Ci coincide. Indeed, for all 
/ = {f{n))nm such that |/|i := sup„gi^ |/(?^)|/7" < c», we have (use sup„gp^*(7'^ + l)/(7" - 
l) = (7+l)/(7-l)) 

mi(/) := sup {^^f4— TT' (^'^■) ^ ^''^ ^ 4 ^ ^ l/li- (55) 
I |7 7 I J 7 ~ 1 

Next we have: V(u, i) G Zx N, i^yi := max(0, i+w). Let us compute the Lipschitz (random) 
coefficient L{^i) with respect to the distance d{i,j) := I7* — 7-'|. We obtain for G 
such that i < j and for all v £ Z: 

(a) d(^F^i, F^j^ = 7^ [7* — 7-' | when z + > and j + v > 0; 

(b) d(F^z, F^j) = |l - 7^'+'' I = 7'' I7"'' - 7^' I when i + v < and j + v > 0; 

(c) d(^Fyi, F^j^ = when i + v < and j + w < 0. 

In Case (b), we have i < —v < j, thus \'y~'" — < I7* — 7-'|- Thus 

d{F,i,F,j) ^ 
L{v) := sup rp- = 7 . 

(ij)eN2,j^j |7*-7^I 

Finally, we obtain E[d(F^^O,0)] = E[|7'^^^(0'''i) - 1|] < E[-f^^]. Thus Conditions (52) implies 
that Conditions (Ci) holds with E[L(i?i)] = E[7'^i] < 1. Consequently, P has an invariant 
distribution vr such that 7r(V^) < 00 from Proposition 3. Then, Property (53a) follows from 
(50) with a := 1 (note that ci = J^Piy) ^'^{y)). To obtain (53b), use the fact that mi(l|j}) = 
(7-1)-^^"^'. □ 

Example 7 (Simulation of a geometric distribution) The Markov kernel P on X := N 
defined for p £ (0, 1) by 

P(0,0) := l-p/2, P(0,l):=p/2 
ViGN*, P(i,i - 1) := 1/2, P(i,z) := (1 -p)/2, P(i, i + 1) := p/2. 

arises from a Hastings-Metropolis sampler of a geometric distribution with parameter p. In 
[MT96, Example 2], P is shown to be V -geometrically ergodic with V = {p~"'^'^)n>o o^i^d to have 
a convergence rate satisfying pv{P) < Po := ^/p+ (1 — p)/2- More specifically: Property (12) 
holds for any p G (pO) 1) with some constant Cp such that limp_j>pyCp = +00. Proposition 5 
allows us to improve this result. 

Indeed, P can also be viewed as the Markov kernel of the discrete Lindley random walk 
where ??i is distributed as: P(t?i = -1) = 1/2, P(??i = 0) = (1 -p)/2 and P(i9i = 1) = p/2. 
Consequently the estimates (53a)-(53b) are valid for this kernel, with here 7 := p^^/^ and 
Hi := E[7'^i] = pq. In particular, Property (12) holds true with p := pQ and Cp^ := (1 + 
^)^/ (1 — ^/p), namely we have for all f = {f{n))n£n such that \ f\i := sup„ |/(n)| p""/^ < oo.- 

Vn>l, V^GN, p'/'\PV{i)-7rif)\< ^\'^_^' \f\i {Vp+^J ■ 
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4.3 K-Geometrical ergodicity of IFS 

Recall that we have set: \/x E X, Va{x) := p{x)'^. Let be an IFS. Under Condi- 

tions {Ca)^ Alsmeyer proved that, when (X„)„,gN is Harris recurrent and the support of vr has 
a non-empty interior, (X„)„,gN is V^-geometrically ergodic, see [Als03, Prop. 5.2]. In Corol- 
lary 6 below, we prove that, under Conditions (Cq), {Xn)n^n is I4-geometrically ergodic with 
a convergence rate such that pVai^) — '^ai provided that : Bo^Ba (for some £ > 1) is 
compact. The same result was proved in [Wu04, Prop. 7.2] under the following alternative 
hypotheses in place of our compactness assumption: P and P^ for some > 1 are Feller 
and strongly Feller respectively. 

First observe that, under Conditions (Ca), Property (49a) with f ■= Va and n := 1 gives 
PVa < £,iVa for some G (0, -|-oo), and so P continuously acts on Ba- Second P fulfills 
Condition (WD) with the function Va- Indeed, let 6 and k be such that < k < 5 < 1. 
Then there exists G N* such that c ma{Va) < , where c = is defined in (49a). Then 
Property (49a) applied to / := gives: P^Va < Va + 7T{Va) . Since 6 is arbitrarily close 
to Ka, the real number (5v^(P) associated with Va via the definition (3) satisfies: 

SvAP)<^a. (57) 

Corollary 6 Let us assume that Conditions (Ca) hold true and that P^ : Bq^ Ba (for some 
i > 1) is compact. Then P is Va- geometrically ergodic, and we have 

VessiP) < SvAP) < and pvSP) < ^a- (58) 

Proof. The fact that P is a power bounded quasi-compact operator on Ba with r^ssiP) ^ 
6va{P) ^ follows from (57) and Theorem 2. From this property, we deduce that the adjoint 
operator P* of P is quasi-compact on the dual space B'a of Ba and r^ssiP*) < ress(-P).^ To 
avoid confusion, we denote by P^£^ the restriction of P on Ca- From Proposition 4 we know 
that P\Ca is a bounded linear operator on Ca- Let us prove that P is VJj- geometrical ergodic 
from Conditions (b) in Theorem 3. 

Let ro G (/?„, 1). We show that A = 1 is the only eigenvalue of P on Ba such that tq < |A| < 
1. Let A G C be such an eigenvalue. Then A is also an eigenvalue of P* since P and P* have 
the same spectrum and ress{P*) < fess{P) ^ < Thus there exists f £ B'a such that 
f'oP = Xf'. But /' is also in since we have: V/ G £a, \{f'J)\ < Wf'h'Jfla < Wf'h'JfWa- 
This proves that A is an eigenvalue of the adjoint of P\Ca- Hence A is a spectral value of P\Ca- 
More precisely A is an eigenvalue of P\Ca since, from (49b), is quasi-compact on Ca and 
ressiP\Ca) — < ^ \M- Finally we have A = 1. Indeed, if A 7^ 1, then any f £ Ca 
satisfying Pf = Xf is such that 7r(/) = 0, thus / = from (49b) (pick k G (/?a,ro)). 

Now we prove that 1 is a simple eigenvalue of P on Ba- Using the previous property 
and the fact that P is power bounded and quasi-compact on Ba, we know that P"— t-H in 
operator norm on Ba, where H is the finite rank eigen-projection on Ker(P — /) = Ker(P — J)^. 
Set m = dimKer(P — /). From [Wu04, Prop. 4.6] (see also [Her08, Th. 1]), there exist m 

^Use Inequality (6), the fact that an operator and its adjoint have the same operator norms, and finally 
the fact that the adjoint of a compact operator is compact. 
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linearly independent nonnegative functions fi,...,fm G Ker(P— /) and probability measures 
/ii, . . . , /im e Ker(P* - /) satisfying /ife(K) < oo such that: V/ G Ba, n/ = Y1T=1 f^kif) fk- 
That 1 is a simple eigenvalue of P on Ba then follows from Proposition 3. 

From Theorem 3 and the previous results, for any G (k^, 1) we have pVa{P) — ^0- Thus 
we have pVa{P) l^^a- D 

Remark 9 In simple examples (as in Example 6), the optimal rate in (49a) is equal to Ka- 
In this case, we have pVa{P) — since Ca C Ba- 

Remark 10 The second inequality in (58) means that, for any real number k £ (Ra, 1), there 
exists a constant 6=6^ such that 

Vn>l, V/Gfi„ |P"/-7r(/)lx|a<eK"|/U. (59) 

Unfortunately the previous approach does not give any information on the constant e of (59). 
Inequality (51) is more precise hut in general is only valid for a smaller class of functions f. 

4.4 Applications to autoregressive models 

Assume that X := M'^ and denote the Lebesgue measure on M"^ by dy. Let || • || denote any norm 
ofM'^, and define o?(a;, y) := — y|| the associated distance on M'^. Set p{x) := 1 + ||2;|| {xq := 0) 
and let us consider Va{x) := (1 + ||x||)" with a G [1, +oo). We have lim||j.|| Va{x) = +oo. 
We know from Remark 3 that any Markov kernel P{x, dy) = K{x, y) dy, associated with a 
continuous (in the first variable) function K : M.'' x M'' — )-[0, +cxd), is compact from Bq to 
Ba- This fact allows us to apply Corollary 6 to some classical IFSs. As an illustration, 
Properties (51) and (58) are detailed below for affine autoregressive (AR) models. Such 
applications can be easily extended for others IFSs, as for instance for functional autoregressive 
models and AR processes with ARCH errors (see Examples 9-10). 

Let {Xn)nGN be the IFS 

Xo G Vn > 1, Xn ■-= AXn^i + (60) 

associated with F{v,x) := Ax + v where A = (aij) is a fixed real q x g-matrix. We have 
L{v) = \\A\\ where ||^|| denotes the induced norm of A corresponding to || • ||, and d(-F„0,0) = 
||?;||. Consequently, Conditions (Ca) hold for a G [l,-|-oo) provided that we have: 

Pll < 1 and E[||i9if] < oo. (61) 

Under these conditions, P has an invariant probability measure from Proposition 3, and we 
can easily prove that limsup||^|| _^_|„j^ PT4(x)/V^(x) < ||^||". Thus 

K„ = \\A\\ =E[L(i?i)'^]^ and 6vAP) < Pf- (62) 
The following result was already proved in [Wu04, Sect. 8]. 



32 



Corollary 7 Assume that Conditions (61) hold true for some a € [1, +00) and that the 
common distribution of {'dn)n>i is absolutely continuous with respect to the Lebesgue measure 
on W. Then P is Va- geometrically ergodic and if r{A) denotes the spectral radius of A, then: 

ress{P)<r{AT and pvAP)=r{A). 



Proof. Under the assumptions of the corollary, if P^ is compact from Bq to Ba, then we can 
deduced from Corollary 6 and (62) that P is V^-geometrically ergodic and 

ressiP) < W and pv^P) < 

Let us check that P^ is compact from Bq to Ba- Let i^(-) denote the density of 1)1. Note 
that P has the form (11) with the Lebesgue measure on W and K{x,y) := v[y — Ax) for 
(x, y) G M'^ X M^. If ^'(•) is continuous, then the desired property follows from Remark 3. In 
the general case we can proceed as follows. Let f £ Bq such that ||/||o < 1- Then we have 

V(2;,2;') G X M«, \{Pf){x') - {Pf){x)\ < [ \i^{y - A{x' - x)) - i^{y)\ dy. 

Since t — t) is continuous from to the Lebesgue space L^(IR'?), it follows that P is 

strongly Feller. Then Lemma 3 gives the desired property. 

Next, since r^ssiP) ^ ll^ll*^ and pVa{P) — ll^ll whatever the norm, we obtain ress(P) < 
r{A)'^ and pVa{P) — ''"i^) using the fact that, for any e > 0, there is an induced norm || • 
such that r{A) < \\A\\i; < r{A) + e. The proof is complete provided that pVa{P) — ^(^)- This 
follows from an easy computation based on the affine recursion (60), see [Wu04, top of p 301]. 

□ 

That the model is V^-geometrically ergodic under Conditions (61) is well-known. However, 
to the best of our knowledge, the rates of convergence obtained in the next statement are 
new. Assertions (i) and (ii) below require stronger hypotheses than in Corollary 7 and 
[Wu04, Prop. 8.3], but they give rates of convergence with explicit constants. Recall that 
the total variation distance between two probability measures pi and p2 on X is defined by 
11/^1 — /^2||tv = s^Pb&x If^iiP) ~ ^2(-B)|. The gradient is denoted by V. 



Corollary 8 Assume that the assumptions of Corollary 7 are fulfilled, that the density i/(-) 
o/i?! is continuously differentiahle on , and that there exist some positive constants (3 and 
b such that 

yv^R", llv-WII < (TTW' ^^^^ 

Then the following assertions hold true: 



(i) If (3 > q + ^ for some 7 G [0, a — 1], then for each f G S-y, we have Pf G Ca and 

Vn > 2, Vx G |E,[/(X„)] - ^(/)| < ci df Pf-^ (1 + ||x||)^ (64) 

where df := ma{Pf) and ci is the constant of Proposition 4- 
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(ii) If f3 > q and if the initial distribution fi is such that := /(I + ||x||)"(i;u(2;) < oo, then: 

Vn>2, WMn^-)-<-)\\TV <cidoI^,\\Ar~^ (65) 

where ci is the constant of Proposition 4, and the constant do can be easily expressed in 
function of the matrix A and the derivative ofv (in link with the norm \\-\\)- For instance, 
if II • II is the supremum norm on then: d^ := max^ Yli=i \^ik\) f^q 11^^(^)11 dy. 

Proof. Recall that P[x,dy) = K{x,y)dy with K{x,y) := i'[y — Ax^ so that the partial 
derivative of K in the direction x satisfies: dxK{x,y) = —A*Vu(^y — Ax^ where A* is the 
adjoint matrix of A. 

Assertion (i) holds from Remark 8, if we prove that P{B^) C Ca- This is deduced from 
Proposition E.l in Appendix E if we check Conditions (76)-(77). From (63) it can be easily 
seen that, for any r > 0, there exists a constant such that we have for all rc G satisfying 
||a;|| < r: Vy G \\dxK{x,y)\\ < (1 + ||y||)~^. Since /3 - j > q, Condition (76) holds. 

Next, set J{x,y) := (1 + ||y||)^ \\dxK{x,y)\\ for y G M?. We have 

[ J{x,y)dy = [ {l + \\Ax + v\\y\\A*Vi^{v)\\dv 

<C{{l + \\x\\y [ \\Vi^{v)\\dv+ [ WvW^ \\Vi^iv)\\dv) 
Jri Jri 

< C"(l + ||x||)^ (66) 
which proves Condition (77). Thus Proposition E.l gives P{B-y) C Ca- 

Under the assumptions of (ii)^ setting ds '■= dig, we deduce from (i) (case 7 = 0) that: 

V5 G Af, Vn > 2, Vx G |P^.(X„ G - ^(5)| < ci (i^ p||"-^ (1 + ||x||)''. (67) 

Assuming (for simplicity) that || • || is the supremum norm on M'?, it follows from (78) that we 
have for all B £ X: 

VxGM'^, ||V(P1b)(x)|| < (maxVloifcl) / \\Vv{y - Ax)\\dy. 

^ ~y Jri 

Thus ds '■= maiPls) ^ do with do given in Corollary 8, and (65) easily follows from (67). □ 



Remark 11 Under the conditions of Assertion (i), an upper bound of the constant df :- 
fna{Pf) for f £ can be easily derived (see Appendix E for details): 



df ■.= ma{Pf) < g|/|-y6C^,/3 (maxV] lajfcl) 

k ^ — ' 

with C^j3 := sup(l + x ) ^ / dy < oo. 

xeRi Jri {^ + \\y\\r 
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Remark 12 The constant ci in (64)-(65) is that of Proposition 4- Let us give an upper bound 
of ci under Conditions (61). Set 



M:=E[||^ir]V-, eo:=- 



\A\\ ( 1 + M - gp 
and r := max (J, 



2 V £0 

Recall that xq := here. Then we have for any x ^Mfl such that \\x\\ > r 



E 



1 + L{'di) d{x,xo) + d{F^^XQ,xo) 
1 + d{x,xo) 



E 



1 + \\A\\ \\x\\ + 
1 + \\x\\ 



< 



1 + ll^ll llxll 



lal 1/a 



1 + llxl 



1 + r 



„ ,„ 1 + M 1 + ll^ll 



1 + r - 2 

Setp:= ((1+ P||)/2)", :=E[(P|| + and ^ := 1 + + r)^ {1 - p) . Recall that 

||7r||b := ( Jjj(l + (i7r(j/))^^^ /or 6 := l,a. Then we have from Proposition 4 and Remark 7 

ci<i^ ||vr||i(l + ||7r|Ur-i. 



Example 8 (Contracting normals) Assume that X := R and that P{x,dy) is the Gaus- 
sian distribution M{Ox,l — O"^) for 9 G (—1,1). This kernel is studied in [Bax05, Exam- 
ple 8.3], in the specific case 9 := 1/2 in [KolOO] and the convergence of its ergodic averages 
is discussed in [RT99, Example 4]- Note that P is the transition kernel of the IFS defined 
by Vn > 1, Xn := 9X^-1 + '&n, where {'dn)n>i is a sequence of M-valued i.i.d. random 
variables, with common distribution J\f {0,1 — 9"^). It can be easily checked that P has the 
standard Gaussian distribution M{0, 1) as invariant probability measure vr. Here we have 
Ba := {/ : M^C, sup.^K |/(x)|/(l + \x\Y < oo}. 

Let a G [l,+oo). Since Conditions (61) hold, we have r^ssiP) ^ I^T with P considered 
as an operator on Ba- Concerning the geometric ergodicity. Corollary 7 ensures that the 
convergence rate of P on Ba satisfies pVa{P) — 1^1 Z'^'" any a G [l,+oo) (also use Example 6 
to obtain the last equality). This improves all the earlier bounds obtained for pVaiL') ^^^■^ 
example (compare with [BaxOS] in case a := 2). 

Furthermore, for this example. Property (65) enables us to improve and simplify the results 
of [RT99, Section 5] concerning the total variation convergence bounds. In fact, for any initial 
distribution p such that I^ := /(I + \x\)dp{x) < oo. Inequality (65) with a := 1 gives (use 
ci := 1 + v^T^, do := 2\9\/y/27T{l-9^)): 



Vn > 2, II^P" - ttWtv < 



In 



Finally Assertion (i) of Corollary 8 provides an interesting alternative result between the 
last one and the Va- geometrical ergodicity. For instance, in case a := 2, Property (64) ensures 
that, for all f £ Bi, we have Pf G £2 and: 



Vn > 2, Vx G M", |E,[/(X„)] - 7r(/)| < cdj l^]" (1 + l^^l)", 



(68) 
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wtth c := 2 (1 + (1 + V2 + 2^/2/ )/\e\ (first use that P^W2{x) = 2(1 — 0^") + 

0'^^W2{x) for n > 1 mi/i ^2(3^) = 1 + 2;^, so i/iai sup„>i sup^.^^ P'^W2{x) /W2{x) = 2; second 
deduce from V2/2 < W2 < V2 that ^ < 4; third check that \\7r\\i = 1 + ^J^j-n , ||7r||2^ := 
2(1 + sJII'k') ). Note that (68) does not involve the Va- geometrical ergodicity (either on Bi, 
or on 82)- However the rate of convergence in (68) is optimal and the associated constant C 
is explicit. The weighted-Lipschitz constant df := m2{Pf) can he easily computed thanks to 
Remark 11. 

The two last examples are classical extensions of the affine ARs. 

Example 9 (The functional autoregressive process) Let {Xn)nen be the IPS 

Xo G R", Vn > 1, Xn := ^(X„„i) + (69) 

associated with F{v,x) := ip{x) + v where ^/^ : M"? — t- M"^ is a fixed differentiable function and 
"i?! has a density z^(-). Assume that 

Q := sup ||V''(x)|| < 1 and 3a G (1, +00), E[||t?if ] < 00. (70) 

Then Corollary 7 extends to the IPS model (69) as follows: ress(P) < a'^ and pVa{P) — ^■ 
A direct adaptation of the above arguments allows us to prove that, if the density i/(-) o/i?! 
satisfies (63), then all the conclusions (i) and (ii) of Corollary 8 hold true with again a in 
place of \\A\\ . 

Example 10 (Autoregressive process with ARCH(l) error) Let {Xn)neN be the real- 
valued IPS 

Xo G M, Vn > 1, Xn := aX^-i + a{Xn-i)'&n, (71) 

where a : M— >M is defined by a{x) := Vb + cx^ with fixed a,b > 0. It can be easily seen 
that the associated Markov kernel is of the form P{x, dy) = K(x, y)dy, provided that the 
probability distribution 0/ ??i has a density v{ ). Corollaries 7 and 8 can be adapted under 
suitable assumptions on v{-). 
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A Positive eigenvectors of the adjoint of a nonnegative operator 
on By 

Proposition A.l If L is a positive bounded linear operator on By such that r{L) = 1, then 
there exists a nontrivial nonnegative continuous linear form rj on B such that rj o L = rj. 

Proof. Since = 1 and the spectrum cr{L) of L is closed, there exists A G cr{L) such that 
|A| = 1. Set Xn '■= A(l + 1/n). From the Banach-Steinhaus theorem, there exists /o G By, 
fo ^ 0) such that ||(A„/ — L)~-'^/o||y +oo when n — )• +oo. Using the Neumann series, namely 

\z\>l ^ = ^z-(^•+l)L^ 

k>0 

the positivity of L (which gives \L^fo\ < L^\fo\): ^-nd finally the fact that By is a Banach 
lattice (V(/,5r) G By : \ f\ < \g\ \\f\\v = \\ \ f\ \\y < \\ \g\ \\v = \\g\\y), we obtain: 

||(A„/-L)-Vo||y < ||(|A„|/-L)-Vo| \\v ^+oo as n^+oo. 

This implies that 1 G cr{L). Now, let By denote the dual space of By with the associated 
norm also denoted by || • and let L* be the adjoint of L. Since cr(L*) = o'{L), we have 
1 G a{L*). 

Let us set /3,i := 1 + 1/n. We deduce from the Banach-Steinhaus theorem that there exists 
G B'y, f'^ > 0, (i.e. V/ G 5y : / > f'^{f) > 0) such that 

K--=\\{l3nI - L*)~'^ f'^Wy^+oo asn-^+oo, 

where / denotes here the identity map on B'y. Let us define the following positive elements 
in the unit ball of B'y: 

f'n ■■= yi^nl- L*r'fo = y E/3n^'+'^ (^ G N*). 

We have > 0, thus H/^Hv = fni^) ~ ^- Thanks to the Banach- Alaoglu theorem, the 
sequence {fn)n has a limit point, say r], in the unit ball of B'y for the weak topology in B'y, 
that is: for all neighborhood W of r] and for all N > 1, there exists n > N such that G W. 
Now, given / G By, consider the following special neighborhoods of rj with respect to the 
weak topology of B'y: 

W{f,p) = |/' G B'y : If'if) - r?(/)| < ^, \f'{Lf) - viLf)\ < i| . 

Let us denote by {np)p = {np{f))p any increasing sequence of integer numbers such that we 
have G W{f,p) for all p > 1. First, pick any / G By, / > 0: then it follows from 
f'n,U) > and \f^f) - rj{f)\ < 1/p that r/(/) > 0. So > 0. Second, consider / := V: then 
we obtain r](y) = 1 from fn^iV) = 1 and |/4p(^) — ^(^)| < so ry / 0. Finally, let us fix 
any / G By- We have 

MfL-^l)if)-{fL-rim) = {j3^I-L*){f'^){f)-{P^I-L*){r,){f) 

= ^/^(/)-(/3„/-L*)(^)(/). 
Replacing n with Up = np{f) gives (/ — L*){rj){f) = as p— ?• -|-oo. Namely: rj o L = r]. □ 
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B Proof of Lemma 3 and Lemma 4 



Proof of Lemma 3. Obviously, we may assume £ := 1. Let (/n)neN £ •^o such that ||/n||o ^ 1- 
From the Banach-Alaoglu theorem, there exist a subsequence (/nj,)fcgN and / G h°°{r]) such 
that 

VffGL^(r/), lim J fn^gdr] = J fgdrj. 

Since K{x, •) G IL'"'^('7) for all x G X, we obtain: \/x G X, limfc(P/„^J(2;) = {Pf){x). Define 
VA; > 1, Vx G X, Afc(x) := sup |(P/„,J(x) - (P/„J(x)|. 

The sequence (PAfc)fc>i is non increasing from A^+i < and the positivity of P. Next, since 
^fc~^0 (pointwise) and ||Afc||oo < 2, we have PA^ \ (pointwise) from Lebesgue's theorem. 
Note that each PA^ is continuous from the strong Feller assumption. Then we deduce from the 
monotone Dini theorem that the sequence (PAyt)fc>i uniformly converges to on each compact 
of X. Now let e > 0. Then there exists A > Q such that d(x,xo) > A ^ V{x)~^ < e/2, 
and there exists ng G N such that: V/c > no, sup^^^, .j,g)<^ |(PAfc)(x)| < £. Since V > 1 and 
||PAfc||o < 2, we obtain for every k > uq: 

llPA,ll„ = ,,pK^M£M<„,„( I(PA,,WI, .„p K^M£i)<, 

xGX y{X) \d{x,xo)<A d{x,xo)>A ^[x) J 

Finally observe that we have for all p,q > k 

\P^fn, - P'/nJ < P(|P/„, - P/nJ) < PAfc, 

therefore we have: \/p,q > no, \\P'^fnq — P'^fnpWv ^ ll-f^nollv < We have proved that the 
sequence {P'^ fn^)k>i is Cauchy in By- Hence it converges in By- D 

Proof of Lemma 4- We know from (WD) that P is power-bounded on By- Let xq G X. Then 
we have K := sup„(P"'y)(xo) < oo. Let TTn, n > 1, be the probability measure on (X, ^Y) 
defined by: VP G X, 7r„(lB) = ^ J2k=o(P'''^B){xo)- Then Markov's inequality gives 

vr (V) K 

Vn > 1, Va G (0, +oo), 7r„(lry>^|) < < — . 

Thus the sequence ('7r„)„>i is tight, and we can select a subsequence {'Knk)k&n weakly con- 
verging to a probability measure vr, which is clearly P- invariant. Next, for p G N*, define 
Vp{-) = mm{V{-),p). Then V/c > 0, Vp > 0, 7r„^(yp) < -Kn^iV) < K. Since Vp is contin- 
uous and bounded on X, we obtain: Vp > 0, lim^ 7r„j,(V^) = vr(V^) < K. The monotone 
convergence theorem then gives TriV) < oo. □ 



C Additional material for discrete Markov chains 
C.l Complements for the proof of Corollary 5 

Lemma 7 Under Conditions (I) -{A), 1 is a simple eigenvalue and the unique eigenvalue of 
modulus one of P on By. 
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Proof. First prove that the support of vr coincides with N. We have 

Vi G N, Vn > 1, 7T{j) := 7r(P"l|,|) = ^7T{i) P^{i,j). 

i>0 

Hence, if 7r(j) = for some j G N, then we would obtain 7r(i) = whenever P^{i,j) > 0, thus 
-7r(i) = for alH E N from Condition (X), which is impossible. 

Second, we have the following implication: VA G C, |A| = 1, V/ G By, 

Pf = Xf P\f\ = \f\. (72) 

Indeed, we deduce that |/| < P\f \ from Pf = Xf and the positivity of P. Then it follows 
from 7r(P|/| - |/|) = that P\f \ = \ f\ vr-a.s. Thus P\f \ = \ f\ since the support of vr is N. 

Third, we prove that 1 is a simple eigenvalue of P on By- Let g G By such that Pg = g, 
and set f := g - g{0)ln. Then Pf = f so that P\f \ = \f\. We have: Vn > 1, = |/(0)| = 
^j>qP"(0,j) Condition (I) then yields / = 0, namely g is constant. 

Finally, let A G C, |A| = 1, and let f e By, f ^ 0, be such that Pf = Xf. It follows 
from (72) and the last statement that, Vn G N, |/(n)| = 1 (up to a multiplicative constant). 
From |A| = 1, |/| = 1, and Vn > 1, A"/(0) = Ej>o ^"(0,i) /(i), we obtaim P"(0,j) > 

^ A"/(0) = /(j). In particular: n G 7^o,o A"/(0) = /(O). This gives: V(?n,n) G 
■^0,0 X 7^o,o, A"""" = 1, hence A = 1 by Condition {A). □ 

C.2 Random walks with bounded increments 
C.2.1 Proof that the integer i in (24) is well-defined 

Set ^4(7) := 0(7)7^ = X^fcLo ^-6+^7'^ where </'(7) is defined in (23b) and uq 7^ 1. The integer 

1 in (24), if well-defined, can be equivalently characterized from Leibniz's formula by 

k e-i 
yk €{!,...,£-!}, #)(!)= ^W(l)-J](6-i)=0 and </,W(l) = AW(l)-J](6-j) / 0, 

j=0 j=0 
according that the first condition is removed when i = 1. To prove the existence of such an 
integer i, observe that, if ^('^^(1) = Y['jZoi^~3) k = 1, ... ,2b, then Taylor's formula would 
give 

^(7) = ^(1) + E ( n(^ - J)) = 1 + E (t) - = 

k=l ^j=o ^ ' k=\ ^ ^ 

which is impossible since oq 7^ 1- 

C.2.2 Condition (WD) for random walks with i.d. bounded increments 

Let P be defined on X = N by 
ViG{0,...,6-l}, E^(^'^') 

j>0 



1; Vi>6,VjGN, P{i,j) 



if 
if 



< b 
> b 
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where b G N*, {a^h, . . . , ab) G [0,1]^''"'"^ and Y^k=-b^k = 1- Assume that oq < 1 and that 
there exists 7 £ (1, +00) such that 

Vie{0,...,6-1}, J]P(i,j)7^' <oo. 

j>o 

The integer I in the next proposition is defined by (24), it is well-defined from Subsection C.2.1. 
Proposition 6 The following conditions are equivalent: 

(a) There exists 70 S (1,7] such that P satisfies Condition (WD) with V^^ := (7o"')neN) CLnd 
we have 

e-i 

(b) ^W(l) < \{{b-j), I.e. 0W(1) <0. 

j=0 

Proof. Let us prove the equivalence (a) <^ (6). Assume that A^^\l) > 13^=0 (^ ~ j)' 
(^W(l) > 0, and prove that, for all 70 G (IjT], P does not satisfy (WD) with V := V^q. From 
the definition of £ and from (p^^\l) > 0, there exists 72 S (1,7] such that <f>{Y) > 0(1) = 1 
for all 7' e (1,72), so that (27) gives 

Vy G (1,72), ViV > 1, limsup^^^^^^y^ = 0(7')"^ > 1. 



n — > +00 



l/y(n) 



Hence, from Corollary 4, for all 7' E (1, 72), P does not satisfy Condition (WD) with V := Vy. 
This proves the desired result. Indeed, if (WD) holds with V := V^g for some 70 € (1,7], 
then (WD) would be fulfilled for all 7' G (l,7o] from Jensen's inequality, which contradicts 
the last conclusion. 

Conversely, assume that (f>^^\l) < 0. Then there exists 71 G (1,7] such that (f>{7o) < 1 for 
all 7o E (l,7i)- Let 70 S (l,7i)- From (27) with k = 1, we obtain {PV^Q){n) = (/)(7o) V^o(n) 
for all n > b. Since by assumption we have X^j>o -P(*! i)7o < ^ each < i < 6 — 1, it 
follows from Corollary 4 that P satisfies (WD) with V := V^p and that Sv^^iP) < 0(7o)- The 
converse inequality follows from (27) and Corollary 4. □ 



D Proof of Formula (50) 

For the sake of simplicity we prove Properties (49a) and (49b) with explicit constants in 
the special case when ki := E[L(i?i)'^]^/'' < 1. Under the general assumption < 1 of 
Conditions (C^), the proof of (49a)-(49b) is similar (replace P by with N such that 
E[L{E,^.,,,r]<l). 

First, we prove that the constant ^ in Proposition 4 is well defined. Second, we obtain 
a basic estimate (75) of the distance between functional of the states occupied at time n of 
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the IFS from two different initial probability distributions. Then, we complete the proof of 
Formula (50). 



First, we have for any x G 



Va{x) 



E 



1 + d{F^^x,xo] 
1 + d{x,xo) 



< E 

< E 



1 + d{F:ff^x,F^^xo) + d {F^^ xq , xq ) ^ " 

1 + d{x,xo) 
1 + L(i?i)d(x,xo) _^ d{F^^xo,xo)Y 



l + d(x,xo) l + d{x,xo) 

Since E[(max(l, L(i?i)) + d{F/}^xo, xo))"-] < oo, we obtain := sup^izx{Pya){x)/Va{x) < oo. 
Next Lebesgue's theorem ensures that limsup(Pla)(x)/V^(x) converges to E[L(t?i)"] when 
d{x, xq) — )• +00. Now let 6 be such that E[L(i?i)"] < 6 < 1. Then there exists r > such that 
we have for all x G X satisfying (i(x,xo) > r: PV^(x) < 6Va{x). Besides, if d{x,XQ) < r, then 
we obtain {PVa){x) < £,iVa{x) < 6(1 + r^. Thus: PVa < SVa + 6(1 + r^lx- Therefore 



< 5" K + 



6(l + r)« 



Ix < 1 + 



1-6 - - V ■ i-s 
This prove that the bound ^ given in Proposition 4 is finite. 



(73) 



Second, let us introduce some additional notations. If /i is a probability measure on X and 
Xq ~ fi, we make a slight abuse of notation in writing {Xn)neN for the associated IFS. We 
simply write (X^)„gN when /j, := 6x is the Dirac mass at some x G X. We denote by .AA-a 
the set of all the probability measures on X such that ||^||a := ifx^aiu) dfj.{y))^^°' < oo. 
Finally, for n G N and for any probability measures /ii and ^2 on X, define: 

Lemma D.l We have: \fn > 1, V(^i,/i2) G Ma x -A4a 

E[A„(/.i,^2)] <C'^«?E[d(Xo^\Xo^^)](||//i|U + ||//2||a)"~' 

Furthermore we have for all f £ Ca-' 
E[\f{Xii^)-f{Xii^ 



<^'^ma{f)K^,E[diX^\X^')]{Ma + May~\ 



(74) 



(75) 



Proof. If a = 1, then (74) follows from the independence of the t?„'s and from the definition 
of L{v) and ki. Now assume that a G (l,+oo). Without loss of generality, one can suppose 
that the sequence {'0n)n>i is independent from (X^^ , X^^). Also note that, if G Ma, then 
we have 



(P"K)(x)dMx)<^||/i||^. 



From Holder's inequality (use 1 = 1/a + (a — l)/a), we obtain 



E 



d{F^„..^,X^\F^„.,^,X^') (p(Xr) +piXii^)) 



a-l 



< E[d{Xi;\Xi;^)]E[L{{}r. : ^1) {p{Xn +p(Xf ))' 

< E[diX^\X^^)]E[L{^r. ■■ ^i)"]'e[(p(X^^) +piXii-)y]^ 



< E[d{X^^ , X^' )] E [Li^iT] » (ll/ii lU + 11/^2 lla 
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This proves (74). Property (75) follows from (74) and the definition of ma{f). □ 
We can prove (50). Property (75), applied to /ii := 5x and ^2 ■= gives 

|(P'^/)(x)-7r(/)| = |E[/(X;?)]-E[/(X„-)]| 

< E[|/(X;?)-/(X-)|] 

< ^^ma{f)K^E[d{x,X^)]{\\6,\\a + Mar~^. 

Next observe that \\Sx\\a = p{x) and 

E[dix,X^)] <E[dix,xo) + d{xo,X^)] <p(x) + 7r((i(2;o,-)) <p(^)IK||i. 

UenceE[d{x,X^)]{\\5x\\a + \maT~'^ < p(x)"||7r||i (1+ ||7r||a)"~^ Property (50) (namely (49a) 
with explicit constant) is then proved. 

Finally, to prove (49b), it remains to study ma{P^ f) for / € Ca- Inequality (75) applied 
to III = 5x and /i2 = liy gives: 

V/g£„ |(P"/)(x)-(P"/)(y)| <C'^m,(/)K?d(x,2/)(p(x)+p(y))"-\ 
Thus ma{P^f) < i^aif) 1^1- Since ma(lx) = 0, this gives 

m,(P"/ - ^{f)lx) < ruaif) 
Combining the last inequality with (50) gives (49b). 



E Additional material for P defined by a kernel K 

Here {Xn)neN is a Markov chain with state space X = M'' ((/ G N*) equipped with any norm 
II • II, and we assume that there exists K : x M"^ — t-IO, +00) measurable such that, for all 
x E X, P{x, dy) is absolutely continuous with respect to the Lebesgue measure on R'^, namely: 
P{x,dy) = K{x,y)dy. 

Let a E [l,+oo) and 7 E [0,a — 1]. The next result is useful to obtain the set inclusion 
P{B-y) C jCfl) where B-y^ Ca are defined by (44) and (46) page 26. 

Proposition E.l Assume that, for all xq E M'^, there exist a Lebesgue-integrable function 
QxQ '■ M*^ — 7'[0, +00) and an open neighborhood U xo of xq in such that: 

dK 

VxEZY,o, k=l,...,q, [l + \\y\\y\—{x,y)\<gxM for a.e. y(^W (76) 
and assume in addition that there exists a constant d such that 

k = l,...,q, VxEM", [ (l + ||y||)7||^(x,y)|(iy<(i(l + ||x||)'^-i. (77) 



Then we have P{B-y) C Ca- 
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Proof. Let f G B^. From Lebesgue's theorem, one can easily deduce that the function 
Pf : R'^ — )• C is differentiable on M'^, and that its derivative is given by: 

d(Pf) r dK 

VA: = 0,...,9, VxGM'?, -\ll{x)= f{y)—{x,y)dy. (78) 



For the sake of simphcity assume that || • || is the euchdean norm on W^. By (78) and (77) 
we obtain: Vx G M"?, ||V(P/)(x)|| < d^\f\^(^l + , where V stands for the gradient 

operator. Then Taylor's inequality gives for any {xi,X2) GM'^ xW^ 



\Pf{xi)-Pf{x2)\ < ||X1-X2|| sup ||V(P/)(tel + (l-t)x2)|| 

te[o,i] 

< "^a/^ I/I7 Iki - 2;2|| sup (1 + + (1 - t)x2||)" 

te[o,i] 

< d\^\f\-y \\xi - X2\\ (1 + ||xi|| + ||3;2||)''~V 

It follows that Pf G Ca. □ 
The following statement gives a simple sufficient condition for (77) to hold true. 

Proposition E.2 Let a £ [l,+oo) and K-i(-) := (1 + II • ||)"~"^- Assume that PK-i/K-i 
is bounded, and 

\(d^K)(x,y)\ , , 

M:= sup ' 1" / I ' <oo. (79) 

(x,j/)gk9xK9 \'^\x^y]\ 

Then, for each 7 € [0, a — 1], Condition (77) is fulfilled. 

Proof Let x G M^. We have 

/ {l + \\y\\y\d^K{x,y)\dy<M [ {1 + \\y\\r-' K{x,y) dy = {PVa-i){x) < C Va-i{x) 

for some constant C. □ 

Proof of Remark 11. Assume that || • || is the supremum norm on M'^. We obtain from (78) 
with K{x,y) := v{y — Ax^ and from (63): V/ G B^, Vx G M'', 

||V(P/)(x)|| < |/|^(maxV|a,fc|) / {I + \\y\\y \\yu{y - Ax)\\dy 

< |/|^(maxV|a,fc|) / (1 + \\y + Ax\\)'^ dy 

~t (i + lly|l)'^ 



< bC^^i3 \f\^ (max^ la^fcl) (1 + ||2;||)^. 



i=l 



We easily deduce that ma{Pf) < qbC^/^jj |/|^ (niax^ Yl'^^i |a«fc|)- D 
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